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0. Preface 

The theory of large-scale structure is presently one of the most active re- 
search areas in cosmology. The important questions being studied include: 
Did structure form by gravitational instability? What are the nature and 
amount of dark matter? What is the background cosmological model? 
What were the initial conditions for structure formation? It is exciting 
that we can ask these questions seriously, knowing that observational tests 
are rapidly improving. 

Numerous papers and reviews discuss specific theoretical models of large- 
scale structure, or specific theoretical techniques for constructing and ana- 
lyzing models. However, there are few coherent presentations of the basic 
physical theory of the dynamics of matter and spacetime in cosmology. Al- 
though there are now several textbooks in this area, I think there is still 
room for further pedagogical development. My aim in these lecture notes is 
to provide a detailed yet readable introduction to cosmological dynamics. 

Although I gave an evening seminar on N-body techniques for simulating 
large-scale structure, for reasons of length I have excluded that subject 
from these notes. The subject is presented elsewhere (e.g., Hockney & 
Eastwood 1981, Efstathiou et al. 1985, Bertschinger & Gelb 1991, and S. 
White's notes in this volume). Otherwise, these notes generally follow the 
lectures I gave in Les Houches, except that my lecture on Lagrangian fluid 
dynamics has been subsumed into the section on relativistic perturbation 
theory. The former subject is still evolving, and does not seem to be as 
fundamental as the subjects of my other lectures. 

I would like to thank Andrew Hamilton, Lam Hui, Bhuvnesh Jain, 
Chung-Pei Ma, Dominik Schwarz, Uros Seljak, and Simon White for use- 
ful comments and discussion, and Rcnnan Bar-Kana, Chung-Pei Ma, Nick 
Gncdin, and Marie Machacek for correcting several errors in early drafts. 
I am grateful to the organizers and students of the Les Houches Summer 
School for providing the opportunity to present this material. I appreci- 
ate the hospitality of John Bahcall and the Institute for Advanced Study, 
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where much of the writing was done. This work was supported by NASA 
grants NAGW-2807 and NAG5-2816. 



1. Elementary mechanics 

This lecture applies elementary mechanics to an expanding universe. At- 
tention is given to puzzles such as the role of boundary conditions and 
conservation laws. 

1.1. Newtonian dynamics in cosmology 

For a finite, self-gravitating set of mass points with positions r^t) in an 
otherwise empty universe, Newton's laws (assuming nonrelativistic motions 
and no non-gravitational forces) are 

IP 1 -*- » < L1 > 

In the limit of infinitely many particles each with infinitesimal mass pd 3 r, 
we can also obtain gi = g(r,, t) as the irrotational solution to the Poisson 
equation, 

V ■ g = -4nGp(r,t) , Vx S = 0, (1.2) 
which may be written 

g(r, t) = - ( Gp(r', t)^— ^ d 3 r' . (1.3) 
J \r — r'l* 

The Newtonian potential 4>, defined so that g = —dcf>/dr (using partial 
derivatives to indicate the gradient with respect to r), obeys V 2 (/> = AirGp. 

If the mass density p is finite and nonzero only in a finite volume, then 
g (and also <f>) generally converges to a finite value everywhere, with g — > 
as r — > oo. If, however, p remains finite as r — > oo, then <fi diverges and g 
depends on boundary conditions at infinity. 

Consider the dilemma faced by Newton in his correspondence with Bent- 
ley concerning the gravitational field in cosmology (Munitz 1957). What 
is g in an infinite homogeneous medium? If we consider first a bounded 
sphere of radius R, Gauss' theorem quickly gives us g — — (47r/3)Gpr for 
r < R. This result is unchanged as R — > oo , so we might conclude that g is 
well-defined at any finite r. Suppose, however, that the surface bounding 
the mass is a spheroid (a flattened or elongated sphere, whose cross-section 
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is an ellipse) of eccentricity e > 0. In this case the gravity field is nonradial 
(see Binney & Tremaine 1987, §2.3, for expressions). The only difference in 
the mass distribution is in the shell between the spheroid and its circum- 
scribed sphere, yet the gravity field is changed everywhere except at r = 0. 
An inhomogeneous density field further changes g. Thus, the gravity field 
in cosmology depends on boundary conditions at infinity. 

There is an additional paradox of Newtonian gravity in an infinite homo- 
geneous medium: g = at one point but is nonzero elsewhere (at least in 
the spherical and spheroidal examples given above), in apparent violation 
of the Newtonian relativity of absolute space. Newton avoided this prob- 
lem (incorrectly, in hindsight) by assuming that gravitational forces due to 
mass at infinity cancel everywhere so that a static solution exists. 

These problems are resolved in general relativity (GR) , which forces us to 
complicate the treatment of Newtonian gravity in absolute space. First, in 
GR distant matter curves spacetime so that (r,t) do not provide good co- 
ordinates in cosmology. Second, in GR we must specify a global spacetime 
geometry explicitly taking into account distant boundary conditions. 

What coordinates shall we take in cosmology? First note that a ho- 
mogeneous self-gravitating mass distribution cannot remain static (unless 
non-Newtonian physics such as a fine-tuned cosmological constant is added 
to the model, as was proposed by Einstein in 1917). The observed mass 
distribution is (on average) expanding on large scales. For a uniform ex- 
pansion, all separations scale in proportion with a cosmic scale factor a{t). 
Even though the expansion is not perfectly uniform, it is perfectly reason- 
able to factor out the mean expansion to account for the dominant motions 
at large distances as in Figure 1. We do this by defining comoving coordi- 
nates x and conformal time r as follows: 



The starting time for the expansion is r = and t = when a = 0; if this 
time was nonexistent (or ill-defined in classical terms) then we can set the 
lower limit of integration for r(t) to any convenient value. Although the 
units of a are arbitrary, I follow the standard convention of Peebles (1980) 
in setting a — 1 today when t = t and t = r . A radiation source emitting 
radiation at t < To has redshift AA/A = z = — 1 + a" 1 where Ao is the 
rest wavelength. 

For a perfectly uniform expansion, the comoving position vectors x re- 
main fixed for all particles. For a perturbed expansion, each particle follows 
a trajectory x (r) [or x (t)]. The comoving coordinate velocity, known also 




(1.4) 
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Fig. 1. Perturbed Hubble expansion. 

as the peculiar velocity, is 

dx dr , , , . 

where = dlna/dt = a~ 2 da/dr is the Hubble parameter. Note that 

v is the proper velocity measured by a comoving observer at x 7 i.e., one 
whose comoving position is fixed. 

[The distinction between "proper" and "comoving" quantities is impor- 
tant. Proper quantities are physical observables, and they do not change 
if the expansion factor is multiplied by a constant. Thus, v = dx/dr = 
(adx)/(adt) is a proper quantity, while dx/dt is not. This is why I prefer 
r rather than t as the independent variable.] 

We shall assume that peculiar velocities are of the same order at all dis- 
tances and in all directions, consistent with the choice of a homogeneous 
and isotropic mean expansion scale factor. These assumptions are consis- 
tent with the Cosmological Principle, which states that the universe is 
approximately homogeneous and isotropic when averaged over large vol- 
umes. In general relativity theory, the Cosmological Principle is applied 
by assuming that we live in a perturbed Robertson- Walker spacetime. Lo- 
cally, the GR description is equivalent to Newtonian cosmology plus the 
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boundary conditions that the mass distribution is (to sufficient accuracy) 
homogeneous and isotropic at infinity. 

Unless otherwise stated, in this and the following lectures (until section 
4) I shall use 3- vectors for spatial vectors assuming an orthonormal basis. 
Thus, A ■ B = AiBi = A l Bi = A l B % with summation implied from i = 1 
to 3. Note that Ai — A 1 are Cartesian components, whether comoving 
or proper, and they are to be regarded (in this Newtonian treatment) as 
3-vectors, not the spatial parts of 4-vectors. (If we were to use 4-vectors, 
then Ai = gijA* — a 2 A 1 in a Robertson- Walker spacetime. Because we are 
not using 4-vectors, there is no factor of a 2 distinguishing covariant and 
contravariant components.) This treatment requires space to be Euclidean, 
which is believed to be an excellent approximation everywhere except very 
near relativistic compact objects such as black holes and, possibly, on scales 
comparable to or larger than the Hubble distance c/H. (In section 4 the 
restrictions to Cartesian components and Euclidean space will be dropped.) 
Also, gradients and time derivatives will be taken with respect to the co- 
moving coordinates: V = d/dx, '= d/dr. 

Before proceeding further we must derive the laws governing the mean ex- 
pansion. Consider a spherical uniform mass distribution with mass density 
p and radius r = xa(t) with x = constant. Newtonian energy conservation 
states 

1 fdr\ 2 GM 

2 \di 

implying 



dlna\ 2 /%0 87r„ , , 

— — = (aB) — — Gap — K , K = -2Ex~ 2 . (1.6) 

UT J 3 

This result, known as the Friedmann equation, is valid (from GR) even if 
p includes relativistic particles or vacuum energy density p vac = A/(8nG) 
(where A is the cosmological constant). The cosmic density parameter is 
57 = 8irGp/ (3H 2 ), so the Friedmann equation may also be written K = 
(il — l)(aH) 2 . Homogeneous expansion, with a = a(r) independent of 
x, requires K = constant in addition to Vp = 0. In GR one finds that 
K is related to the curv atur e of space (i.e., of hypersurfaces of constant 



t). The solutions of eq. (1^) for zero-pressure (Friedmann) models, two- 
component models with nonrelativistic matter and radiation, and other 
simple equations of state may be found in textbooks (e.g., Padmanabhan 
1993, Peebles 1993) or derived as good practice for the student. 

At last we are ready to describe the motion of a nonuniform medium 
in Newtonian cosmology with mass density p(x, r) = p(r) + 5p{x, r). We 
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start from Newton's law in proper coordinates, cPr /dt 2 = g, and transform 
to comoving coordinates and conformal time: 

d 2 x a dx d f a\ _ , f . . . (x — x') . 

We eliminate the homogeneous terms (those present in a homogeneous 
universe) as follows. First, assuming that the universe is, on average, 
spherically symmetric at large distance, the first term on the right-hand 
side becomes (from Gauss' theorem) — (4ir/3)Ga 2 px. (This is where the 
boundary conditions at infinity explicitly are used.) To get the term pro- 
portional to x on the left-hand side, differentiate the Friedmann equation: 
(a / d)d{a / a) / dr — (4irG /3)d(pa 2 ) / dr . For nonrelativistic matter, p oc a~ 3 , 
implying d(pa 2 )/dT — —dpa, so d(a/a)/dT = — (4ir/'3)Ga 2 p. (If p includes 
relativistic matter, not only is dp/dr changed, so is the gravitational field. 
Our derivation gives essentially the correct final result in this case, but 
its justification requires GR.) We conclude that the homogeneous terms 
cancel, so that the equation of motion becomes 

d 2 x a dx 9 f . . . x (x — x'), % . _ ,, 

+ aTr - J S «*> T) ]x-^ ^ S " V0 ' 

where 

4/{x,T) = -Ga> [ Sp V> T)d \ X ' ■ 
J \x-x'\ 

Note that <p' is a proper quantity: a 2 d 3 x' j\x — x'\ ~ d 3 r/\r — r'\. 

If J Spd 3 x — > when the integral is taken over all space — as happens if 
the density field approaches homogeneity and isotropy on large scales, with 
p being the volume-averaged density — then (/)' is finite and well-defined 
(except, of course, on top of point masses, which we ignore by treating the 
density field as being continuous). Newton's dilemma is then resolved: we 
have no ambiguity in the equation of motion for x(t). We conclude that 
</>', sometimes called the "peculiar" gravitational potential, is the correct 
Newtonian potential in cosmology provided we work in comoving coordi- 
nates. Therefore we shall drop the prime and the quaint historical adjective 
"peculiar." In summary, the equations of motion become 

^ + «^ = _ V ^, V 2 = 4nGa 2 5p(x,T) . (1.7) 
dr z a dr 

As we shall see in section 4, the same equations follow in the weak-field 
(| 0| <C c 2 ), slow- motion (v 2 <C c 2 ) limit of GR for a perturbed Robertson- 
Walker spacetime. If Newton had pondered more carefully the role of 
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boundary conditions at infinity, he might have invented modern theoretical 
cosmology! 

1.2. Lagrangian and Hamiltonian formulations 

The equations of Newtonian cosmology may be derived from Lagrangian 
and Hamiltonian formulations. The latter is particularly useful for treat- 
ments of phase space. 

In the Lagrangian approach, one considers the trajectories x(t) and the 
action S^ce^t)]. From elementary mechanics (with proper coordinates and 
no cosmology, yet), S = J Ldt with Lagrangian L — T — W = \mv 2 — m<f> 
for a particle moving in a potential <f) (T is the kinetic energy and W is 
the gravitational energy). We now write a similar expression in comoving 
coordinates, bearing in mind that the action must be a proper quantity: 



J L(x, x, r)dr , L 



—mv 
2 



m(j) 



(1.8) 



where x = v is the peculiar velocity. We will show that eq. ( |1.8| ) is the 
correct Lagrangian by showing that it leads to the correct equations of 
motion. 

Equations of motion for the trajectories follow from Hamilton's principle: 
the action must be stationary under small variations of the trajectories 
with fixed endpoints. Thus, we write x(t) — > x(t) + Sx(t), dx/dr — > 
dx/dr + (d/ dr)5x{T). The change in the action is 



6S = 



|^ • Sx 
ox 



9L 
dx 



d 



dL 
dx 

dL 
dx 



dr 



■ Sx(t) dr , 



where we have integrated by parts assuming (dL/dx) -5x = at r = t\ and 
T2- Applying Hamilton's principle, 6S = 0, we obtain the Euler-Lagrange 
equation (it works in cosmology, too!): 

d fdL\ dL _ 
dr \ dx J dx 



(1.9) 



The reader may verify that substituting L from eq. (1.8) yields the correct 
equation of motion (1.7). 

It is straightforward to extend this derivation to a system of self- 
gravitating particles filling the universe. The Lagrangian is 



W 



(1.10) 
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where the total gravitational energy excludes the part arising from the 
mean density: 

W=^[}~" m,0, -a 3 p (f) d A x 




E 



Gi 



1 3 




Ga 2 p / , , (1.11) 



where the factor = is introduced to avoid double-counting pairs of particles. 
For a continuous mass distribution we obtain 

1 /"jii_„sja y ._ 1 n„5 /"j3„ 6 P (xx,t)6p(x 2 ,t) 



W = - N)5pa 6 d 6 x= — Ga b d A x x d 6 x 2 ry , ' rv , ; .(1.12) 
2 7 2 J J \xi-x 2 \ 

In the Hamiltonian approach one considers the trajectories in the single- 
particle (6-dimensional) phase space, {x(t),p(t)}. The aim is to obtain 
coupled first-order equations of motion for x(t) and p(r), known as Hamil- 
ton's equations, instead of a single second-order equation for x(r). 

The derivation of Hamilton's equations has several steps. First we need 
the canonical momentum conjugate to x: 

dL dx 
p = — = amv = am — . (1.13) 
y dx dr y ' 

Note that p is not the proper momentum measured by a comoving observer: 
mv is. In Hamiltonian mechanics, one must use the conjugate momentum 
and not the proper momentum. 

The next step is to eliminate dx/dr from the Lagrangian in favor of 
p. We then transform from the Lagrangian to a new quantity called the 
Hamiltonian, using a Legendre transformation: 

L(x,x,t) — > H(x,p,r) = p ■ x — L . (1-14) 



Notice that we transform L to H and x to p (the latter through eq. 1 . 1 3| ) 



Why do we perform these transformations? The answer is that now Hamil- 
ton's principle gives the desired equations of motion for the phase-space 
trajectory {x(t),p(t)}. In phase space, Hamilton's principle says that the 
action S = J L dr = J(p ■ x — H) dr must be stationary under indepen- 
dent variations of all phase space coordinates: x(t) — * x(t) + Sx(t) and 
p(t) — * p(r) + &p(t). As an exercise, the reader can show, using a method 
similar to the derivation of the Euler-Lagrange equation above, 

dx dH dp dH M ^ 
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provided that p ■ Sx = at the endpoints of r. 

In our case, H = p 2 /(2am) + am<f) (getting the a's right requires using 
the Legendre transformation), yielding 

dx p dp 

— — = , — = —amvcj) ■ (1.16) 

dtr am dr 



These equations could be combined to yield eq. (1.7), but in the Hamilto- 
nian approach we prefer to think of two coupled evolution equations. This 
is particularly useful when studying the evolution of a system in phase 
space, as we shall do in section 3 with hot dark matter. 



1.3. Conservation of momentum and energy? 

Are total momentum and energy conserved in cosmology? This is a non- 
trivial question because the canonical momentum and Hamiltonian differ 
from the proper momentum and energy. 

Consider first the momentum of a particle in an unperturbed Robertson- 
Walker universe. With no perturbations, <f> — so that Hamilton's equation 
for p becomes dp/ dr = -amV^ = 0, implying that the canonical momen- 
tum p is conserved. But, the proper momentum mv — a _1 p measured by a 
comoving observer decreases as a increases. What happened to momentum 
conservation? 

The key point is that v — dx/dr is measured using a non-inertial (ex- 
panding) coordinate system. Suppose, instead, that we choose v to be a 
proper velocity measured relative to some fixed origin. Momentum conser- 
vation then implies v = constant (if V0 = 0, as we assumed above). At r = 
Ti and T2, the particle is at x\ and X2, respectively. Because dx/dr gives the 
proper velocity relative to a comoving observer at the particle's position, at 
n we have dx/dr = v— (d/a)iX\, while at T2, dx/dr = v— (0/0)2X2- (The 
proper velocity relative to the fixed origin is v in both cases, but the Hubble 
velocity at the particle's position — the velocity of a comoving observer — 
changes because the particle's position has changed.) Combining these, we 
find [x(t-2)-x(t-l)]/(t2-t{) w -(o/a)[x(r 2 ) -x(ti)]/(t 2 -n) + 0(r 2 -n) 
or, in the limit T2 — t\ — > 0, d 2 x/dr 2 = —(a/a)dx/dr. This is precisely 
our comoving equation of motion in the case = 0. Thus, the "Hubble 
drag" term (a/a)dx/dr is merely a "fictitious force" arising from the use 
of non-inertial coordinates. Stated more physically, the particle appears to 
slow down because it is continually overtaking faster moving observers. 

Energy conservation is more interesting. Let us check whether the Hamil- 
tonian H(x,p,r) is conserved. Using Hamilton's equations for a single 
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particle, we get 

dr dx dr dp dr 9r 9t 

Using H = p 2 /(2am) + am<f) 7 we obtain dH/dr = — (a/a)(p 2 /2am) + 
md(a(j))/dT which is nonzero even if dip/dr — 0. Is this lack of energy 
conservation due to the use of non-inertial coordinates? While the appear- 
ance of a Hubble-drag term may suggest this is the case, if we wish to 
obtain the total Hamiltonian (or energy) for a system of particles filling all 
of space, we have no choice but to use comoving coordinates. 

Perhaps the Hamiltonian is not conserved because it is not the proper 
energy. To examine this possibility, we use the Hamiltonian for a system of 
particles in comoving coordinates, with H = a(T + W). The proper kinetic 
energy (with momenta measured relative to comoving observers) is 



while the gravitational energy W is given in eq. (1-11). Holding fixed 
the momenta, we see that a 2 T is a constant, implying d(aT)/dr = —aT. 
Similarly, holding fixed the particle positions, we find that a<f> is a con- 
stant, implying d(aW)/dr = 0. We thus obtain the Layzer-Irvine equation 
(Layzer 1963, Irvine 1965) 



Total energy (expressed in comoving coordinates) is not conserved in 
Newtonian cosmology. (This is also the case in GR — indeed, there is 
generally no unique scalar for the total energy in GR.) However, if almost 
all of the mass is in virialized systems obeying the classical virial theorem 
2T + W w 0, we recover approximate total energy conservation. 

2. Eulerian fluid dynamics 

2.1. Cosmological fluid equations 

A fluid is a dense set of particles treated as a continuum. If particle 
collisions are rapid enough to establish a local thermal equilibrium (e.g., 
Maxwcll-Boltzmann velocity distribution), the fluid is an ideal collisional 
gas. If collisions do not occur (e.g., a gas of dark matter particles), the gas 
is called collisionless. (I exclude incompressible fluids, i.e., liquids, from 




(1.18) 




(1.19) 
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consideration because the gases considered in cosmology are generally very 
dilute and compressible.) The fluid equations discussed in this lecture ap- 
ply only for a collisional gas (or a pressureless collisionless gas). They 
apply, for example, to baryons (hydrogen and helium gas) after recombi- 
nation, to cold dark matter before trajectories intersect ("cold dust"), and 
(with relativistic corrections) to the coupled photon-baryon fluid before 
recombination. 

I shall assume a nonrelativistic gas and ignore bulk electric and mag- 
netic forces. These are not difficult to add, but the essential physics of 
cosmological fluid dynamics does not require them. 

The fluid equations consist of mass and momentum conservation laws and 
an equation of state. Mass conservation is represented by the continuity 
equation. In proper coordinates (r, t) this is 

3p 9 dr , 

We convert to comoving coordinates r = j dt/a(t), x = r/a(t), being care- 
ful to transform the partial derivatives as follows: d/dt = (9r/9t)9/9r + 
(dx/dt) ■ d/dx, 9/9r = a~ 1 d/dx = a -1 V. We also rewrite the density and 
velocity by factoring out the mean behavior: 

dv 

p = p(l + S), —=Hr + v (2.2) 
where v = dx/dr is now the peculiar velocity. The reader may easily show 



that eq. (2.1) becomes 

^- + V-[(l + S)v]=0. (2.3) 
9r 

Momentum conservation for an ideal fluid is represented by the Euler 
equation (Landau & Lifshitz 1959). It is most simply obtained by adding 
the pressure-gradient force to the equation of motion for a freely-falling 



mass element, eq. (1.7). In comoving coordinates, we find 



dv a _ , 1_ ,„ „. 

— + -v = -V<f) Vp . (2.4) 

dr a p 

The time derivative is taken along the fluid streamline and is known as the 
convective or Lagrangian time derivative: 

-f = f + u-V. (2.5) 
dr or 

Closing the fluid equations requires an evolution equation for the pressure 
or some other thermodynamic variable. Perhaps the most natural is the 
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entropy. For a collisional gas, thermodynamics implies an equation of 
state p = p(p, S) where S is the specific entropy. For example, for an ideal 
nonrelativistic monatomic gas, for reversible changes we have 



which says that the heat input to a fluid element equals the change in 
thermal energy plus the pressure work done by the clement, i.e., energy is 
conserved. Combining this with the ideal gas law p = pk^T j p where p, is 
the mean molecular mass and ks is the Boltzmann constant, we obtain 



The equation of state must be supplemented by an evolution equation for 
the specific entropy. Outside of shock waves, the entropy evolution equation 
is 



where T and A are, respectively, the proper specific heating and cooling 
rates (in erg g _1 s _1 ). They are determined by microphysical processes 
such as radiative emission and absorption, cosmic ray heating, Compton 
processes, etc. For the simplest case, adiabatic evolution, T = A = 0. For 
a realistic non-ideal gas, it may be necessary to evolve the radiation field, 
the ionization fraction, and other variables specifying the equation of state. 

The fluid equations are much harder to solve than Newton's laws for 
particles falling under gravity, for several reasons. First, they arc non- 
linear partial differential equations rather than a set of coupled ordinary 
differential equations. Second, shock waves (discontinuities in p, p, S, and 
v) prevent intersection of fluid elements. These discontinuities must be 
resolved (on a computational mesh or otherwise) and followed stably and 
accurately. Finally, heating and cooling for realistic gases are complicated 
and can lead to large temperature or entropy gradients that are difficult to 
resolve. An example of the latter is the sun, whose temperature changes 
by about 15 million K in a distance that is minuscule compared with cos- 
mological distance scales. 

Computational fluid dynamics is a difficult art but is important for 
galaxy formation. I shall not summarize the numerical methods here but 
refer the reader instead to the literature (e.g., Sod 1985, Lcvcque 1992, 
Monaghan 1992, Bryan et al. 1994, Kang et al. 1994). 

Some of the most important effects of gas pressure can be gleaned 
from linear perturbation theory, in which we linearize the fluid equations 




(2.6) 




(2.7) 




(2.8) 
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about the uniform solution for an unperturbed Robertson- Walker space- 
time. This technique is useful for checking for gravitational and other 
linear instabilities. Moreover, the linearized fluid equations may provide 
a reasonable description of large-scale, small-amplitude fluctuations in the 
(dark+luminous) matter, even if structure is nonlinear on small scales. 
This is a common assumption in large-scale structure theory. It is sup- 
ported reasonably well by numerical simulations. 
Linearizing the continuity and Euler equations gives 

5 + V-vKiO, v + -v w -V<f)- -Vp , (2.9) 
a p 

where an overdot denotes 3/3r. The pressure gradient may be obtained 
from the equation of state p = p{p, S) . For an ideal nonrelativistic 
monatomic gas, 

Ivp = ^+|TV5, cl = \v_. (2.10) 

p 6 dp 

Finally, we must linearize the entropy evolution equation. If the time scale 
for entropy cha nges is long compared with the acoustic or gravitational 



time scales, eq. (2.S) becomes dS/dr w 0. For the small peculiar velocities 
of linear perturbation theory this reduces to S « 0. 

There are five fluid variables (p, S, and three components of v), hence 
five linearly independent modes. The general linear perturbation is a linear 
combination of these, which we now proceed to examine. 

2.2. Linear instability 1: isentropic fluctuations and Jeans criterion 

We begin with some nomenclature from thermodynamics. Isentropic 
means VS = 0: the same entropy everywhere. Adiabatic means 
dS/dr = 0: the entropy of a given fluid element does not change. The 
two concepts are distinct. It is common in cosmology to say "adiabatic" 
when one means "isentropic." This usage is confusing and I shall adopt 
instead the standard terminology from thermodynamics. 

Isentropic fluctuations are the natural outcome of quantum fluctuations 
during inflation followed by reheating: rapid particle interactions in ther- 
mal equilibrium eliminate entropy gradients. If VS 1 = 0, the linearized 
fluid and gravitational field equations are 

5 + V • v = , v + -v = -V<p - c 2 s VS , V 2 4> = 4irGpa 2 6 . (2.11) 

a 

Combining these gives a damped, driven acoustic wave equation for 5: 

6 + -S = 4TrGpa 2 6 + c 2 V 2 6 . (2.12) 
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Aside from the Hubble damping and gravitational source terms, this equa- 
tion is identical to what one would get for linear acoustic waves in a static 
medium. 

To eliminate the spatial Laplacian we Fourier transform the wave equa- 
tion. For one plane wave, 5(x, r) — > 6(k, r) exp(ife • x). The wave equation 
becomes 

5 + -8 = (AirGpa 2 - k 2 c 2 s ) S = (k 2 - k 2 ) c 2 s 6 , (2.13) 
where we have defined the comoving Jeans wavenumber, 
/4vrGpa 2x 1/2 



Neglecting Hubble damping (by setting a = 1), the time dependence of 



the solution to eq. (2.13) would be 5 oc exp(— ilut), yielding a dispersion 
relation very similar to that for high-frequency waves in a plasma, but with 
an important sign difference because gravity is attractive: 

uj 2 = cj 2 + k 2 c 2 -> uj 2 = -wj + k 2 c 2 . (2.15) 

The plasma frequency is lu p = (47rn e e 2 /m e ) 1 / 2 while the Jeans frequency is 
lu,i = fcjc B = {AnGp) 1 / 2 . Whereas electromagnetic waves with uJ 2 < ui 2 do 
not propagate (k 2 < implies they are evanescent, e.g., they reflect off the 
Earth's ionosphere), gravitational modes with k < fcj are unstable (u> < 
0), as was first noted by Jeans (1902). In physical terms, pressure forces 
cannot prevent gravitational collapse when the sound-crossing time A/c s is 
longer than the gravitational dynamical time (Gp)^ 1 ^ 2 for a perturbation 
of proper wavelength A = 2-na/k. 

Including the Hubble damping term slows the growth of the Jeans insta- 
bility from exponential to a power of time for k <C fcj. In general there is one 
growing and one decaying solution for i5(/c,r); these are denoted S±(k,r). 
For c 2 = and an Einstein-de Sitter (flat, matter-dominated) background 
with a(r) oc r 2 , 5 + oc t 2 and oc r -3 . For k 3> fcj, we obtain acoustic 
oscillations. In a static universe the acoustic amplitude for an adiabatic 
plane wave remains constant; in the expanding case it damps in general. 
An important exception is oscillations in the photon-baryon fluid in the 
radiation-dominated era; the amplitude of these oscillations is constant. 
(Showing this requires generalizing the fluid equations to a relativistic gas, 
a good exercise for the student.) In any case, acoustic oscillations suppress 
the growth relative to the long- wavelength limit. 

It is interesting to write the linear wave equation in terms of (f> rather 



Cosmological Dynamics 



19 



than S using V 
c 2 < c 2 ): 



4irGa 2 pS cx a 5 for nonrelativistic matter (with 



la^_ 3 
2 a 2 2 V 



■ + fc 2 c s 2 = , 



(2.16) 



where we used the Friedmann equation (1.6); recall that K = (17— l)(ai?) 2 
is the spatial curvature constant. In a matter-dominated universe, differen- 
tiating the Friedmann equation gives a/a—(\/2)a 2 /a 2 = —{1/2)K, yielding 



3-4>+ (k 2 c 2 - 2K) (j> = . (2.17) 



a 

When written in terms of the gravitational potential rather than the den- 
sity, the wave equation loses its gravitational source term. 



The solutions to eq. ( 2.17 ) depend on the time-dependence of the sound 
speed as well as on the background cosmology. To get a rough idea of the 
behavior, consider the evolution of the potential in an Einstein-de Sitter 
universe filled with an ideal gas. For a constant sound speed, the solutions 
are 

(f> + (k,r) = T~ 2 j 2 (kc B T) , (t)-(k,r) = T- 2 y 2 (kc s T) , c s = const. , (2.18) 

where j 2 and y 2 arc the spherical Bcssel functions of the first and second 
kinds of order 2. Although simple, this is not a realistic solution even 
before recombination (in that case, the photons and baryons behave as 
a single tightly-coupled relativistic gas, and relativistic corrections to the 
fluid equations must be added), except insofar as it illustrates the generic 
behavior of the two solutions: (damped) oscillations for kc s r ^> 1 and 
power-law behavior for kc s r <C 1. 

An alternative approximation, valid after recombination, is to assume 
that the baryon temperature roughly equals the photon temperature (this 
is a reasonable approximation because the small residual ionization ther- 
mally couples the two fluids for a long time even though there is negligible 
momentum transfer), c 2 = Cg s a _1 where co s is a constant. In this case the 
solutions are powers of r: 



- /, s „ -5 ± y/25 - 4(fcc 0s r ) 2 _! 
9±{k,T)=T , n= 5! ; T gas cx a . (2.19) 

The solutions oscillate for fcc s ro > 5/2 and they damp for fcc s To > 0. 

In both of our solutions, and indeed for any reasonable equation of state 
in an Einstein-de Sitter universe, long-wavelength (fcc s r <^ 1) growing den- 
sity modes have corresponding potential <j)+ = constant, while the decay- 
ing density modes have 4>- cx J a~ 3 d,T. The density perturbation and 
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potential differ by a factor of pa 2 oc a~ 1 from the Poisson equation. If 
K < or k 2 c 2 > 0, then <f>+ decays with time, although 6+ still grows. 
Note that the important physical length scale where the transfer func- 
tion <p + [k, t) /(f>(k, 0) falls significantly below unity is the acoustic comoving 
horizon distance c s r, not the causal horizon distance ct or the Hubble dis- 
tance c/ H. Setting c s to the acoustic speed of the coupled photon-baryon 
fluid at matter-radiation equality gives the physical scale at which the bend 
occurs, c s r eq , in the power spectrum of the standard cold dark matter and 
other models. 



2.3. Linear instability 2: entropy fluctuations and isocurvature mode 
Entropy gradients act as a source term for density perturbation growth. 



Using eq. (2.10) and repeating the derivation of the linear acoustic equa- 
tion, we obtain (for c 2 <C c 2 ) 

S + -5- 4nGpa 2 5 - c 2 V 2 5 = -TV 2 S . (2.20) 
a 3 

For adiabatic evolution, S = 0, so what counts is the initial entropy gradi- 
ent. Entropy gradients may be produced in the early universe by first-order 
phase transitions resulting in spatial variations in the photon/baryon ratio 
or other abundance ratios. If there were no entropy gradients present be- 
fore such a phase transition, then the entropy variations can only have been 
produced by nonadiabatic processes. (This may explain the "adiabatic vs. 
isocurvature" nomenclature used by some cosmologists.) In practice, these 
entropy fluctuations are taken as initial conditions for subsequent adiabatic 
evolution. 



Equation (2.20) is not applicable to the early universe because it assumes 
the matter is a one-component nonrelativistic gas. However, the behavior 
of its solutions are qualitatively similar to those for a relativistic multi- 
component gas and so its analysis is instructive. 

The isocurvature mode is given by the particular solution of density 
perturbation growth having 5 = 5 = but V 2 S at some early ini- 
tial time Tj. The initial conditions may be regarded as a perturbation in 
the equation of state in an otherwise unperturbed Robertson- Walker (con- 
stant spatial curvature) spacetime, accounting for the name "isocurvature." 
Variations in entropy at constant density correspond to variations in pres- 
sure, which lead through adiabatic expansion to changes in the density. 
Therefore, initial entropy fluctuations seed density fluctuations. 
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The solution to eq. (2.2C ) is obtained easily in Fourier space using the 

fcions S±(k, t): 

5 + {k,r) f a'T'5'_dT' 



source-free (isentropic) solutions S±(k,r): 
2 



Ss(Kt) = -^k 2 S(k) 



5-(k,r) / a'T'S'+dr' 



(2.21) 



where primes are used to indicate that the variables are evaluated at r = r'. 
We see that both growing and decaying density perturbations are induced. 
After the source (aT<5_) becomes small, the density fluctuations evolve the 
same way as isentropic fluctuations — e.g., they oscillate as acoustic waves 
if kc s T 1. To reinforce the point about nomenclature made earlier, 
I note that in our approximation, both isocurvature and "adiabatic" (i.e., 
isentropic) modes are adiabatic in the sense of thermodynamics: 5 = after 
the initial moment. For a realistic multi-component gas the evolution is not 
truly adiabatic, but that is a complication we shall not consider further. 
In the literature, modes are described as being adiabatic or isocurvature 
depending only on whether the initial density is perturbed with negligible 
initial entropy perturbation, or vice versa. 

2.4- Vorticity — or potential flow? 

With the growing and decaying isentropic perturbations, and the isocurva- 
ture mode, we have accounted for three of the expected five linear modes. 
The remaining two degrees of freedom were lost when we took the di- 
vergence of the Euler equation, thereby annihilating any transverse (rota- 
tional) contribution to v. We consider them now. 

Theorem: Any differentiable vector field v(x) may be written as a sum 
of longitudinal (curl-free) and transverse (divergence-free) parts, v\\ and 
Vj_, respectively: 

v(x) = v\\(x) + v±(x) , V x r>|| = V • v ± = . (2.22) 

The proof follows by construction, by solving V • V\\ =6 and V x v± — lo 
where 9 = V • v and to = V x v. In a flat Euclidean space, solutions are 
given by 

iw = £/w£_^. = ( 2 - 23 ) 

t7 X (sb) = — I w(sb') x [ X ~ X l d 3 x' , lo(x) = V x v . (2.24) 

47T / \X — X'\ 6 
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Note that this decomposition is not unique; we may always add to vu a 
curl-free solution of V • vn =0 and to v± a divergence-free solution of 
V x »i = ( e -g-7 constant vectors). With suitable boundary conditions 
(e.g., J vu d 3 x = when integrated over all space) this freedom can be 
eliminated. The variables 9 and uj are called the (comoving) expansion 
scalar and vorticity vector, respectively. 

In our preceding discussion of perturbation evolution we have implicitly 
considered only The remaining two degrees of freedom correspond to 
the components of v± (the transversality condition V • v± =0 removes 
one degree of freedom from this 3- vector field). Fortunately, we can get a 
simple nonlinear equation for v± — actually, for its curl, uj — by taking 
the curl of the Euler equation: 

Co + -uj = V x (v x lo) + p~ 2 (Vp) x (Vp) 

= V x (v x uj) + §T(V In p) x (VS) (2.25) 

where we have assumed an ideal monatomic gas in writing the second form. 
The term arising from entropy gradients is called the baroclinic term. It 
is very important for the dynamics of the Earth's atmosphere and oceans 
(Pcdolsky 1987). 

An important general result follows from eq. (2.25), the Kelvin Circu- 
lation Theorem: If uj = everywhere initially, then uj remains zero (even 
in the nonlinear regime) if the baroclinic term vanishes. (We are assuming 
that other torques such as magnetic ones vanish too.) The reason for the 
importance of this result in cosmology is that many models assume irro- 
tational, isentropic initial conditions. With adiabatic evolution, it follows 
that uj = 0. Such a flow is also called potential flow because the velocity 
field may then be obtained from a velocity potential: v = v\\ = — V$ v . 

Nonadiabatic processes (heating and cooling) and oblique shock waves 
can generate vorticity. In a collisionlcss fluid, if the fluid velocity is defined 
as the mass- weighted average of all the mass elements at a point, this 
averaging behaves like entropy production in regions where trajectories 
intersect, and so vorticity can be generated in the mean (fluid) velocity 
field. Vorticity also arises from isocurvature initial conditions. Equation 
(2.21) implies 8s oc V 2 ^ for long wavelengths in the linear regime, giving 
a baroclinic torque proportional to Vc>s x VS oc V(V 2 S) x VS, which 
is nonzero in general (though it appears only in second-order perturbation 
theory) . 

For most structure formation models, vorticity generation is quite small 
until shocks form (or trajectories intersect, for collisionless dark matter). 
In this case, one may obtain the velocity potential from the line integral of 
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v ■ dl . 



(2.26) 



o 



Taking the path to be radial with the observer in the middle allows one 
to reconstruct the velocity potential, and therefore the transverse velocity 
components, from the radial component. This idea underlies the potential 
flow reconstruction method, POTENT (Bertschinger & Dekel 1989). If the 
(smoothed) density fluctuations are sufficiently small for linear theory to 
be valid, we can estimate the density fluctuation field from an additional 
divergence. If pressure is unimportant, so k <C fcj and 5 oc 6 + (t), the 
linearized continuity equation gives 



For a wide range of cosmological models, dlnS+/dlna = f(fl) w ft - 6 
depends primarily on the mass density parameter and weakly on other 
cosmological parameters (Peebles 1980, Lahav et al. 1991). Thus, com- 
bining measurements of v (radial components from galaxy redshifts and 
distances) and independent measurements of S (from the galaxy density 
field plus an assumption about how dark matter is distributed relative to 
galaxies) allows estimation of ft (Dekel et al. 1993). A review of the PO- 
TENT techniques and results is given by Dekel (1994). 

3. Hot dark matter 

The previous lecture studied the evolution of an ideal collisional gas in- 
cluding gravity and pressure. A gas of neutrinos, or of collisionlcss dark 
matter particles, behaves differently. In this lecture we investigate the evo- 
lution of a nonrelativistic collisionlcss gas whose particles have significant 
thermal speeds. (Rclativistic kinetic theory is discussed by Stewart 1971, 
Bond & Szalay 1983, and Ma & Bertschinger 1994b.) An example is the 
gas of relic thermal neutrinos that decoupled at a temperature k^T <~ 1 
MeV in the early universe. The present number density of these neutrinos 
(about 113 cto~ 3 for each of the three flavors) is such that a single massive 
type contributes m v c 2 / (93 h 2 eV) to fi, where h = H /(100 kins" 1 Mpc -1 ). 
Massive neutrinos are called hot dark matter because their thermal speeds 
significantly affect the gravitational growth of perturbations. 

Before working out the detailed equations of motion for hot dark matter, 
it is useful to consider in general terms the effect of a thermal distribution. 




(2.27) 
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Suppose we have a cold gas with no thermal motions. In this case it doesn't 
matter whether the gas is collisional or collisionlcss: gravitational instabil- 
ity amplifies the growing mode of irrotational density perturbations. What 
happens when we add thermal motions? We know the answer for a colli- 
sional gas: pressure stabilizes collapse for wavelengths less than the Jeans 
length, the distance sound waves travel in one gravitational dynamical time. 
For collisionlcss particles we also expect suppression. However, a collision- 
less gas cannot support sound waves, because no restoring force is provided 
by particle collisions. 

A perfect collisional gas is fully described by its mass (or energy) density, 
fluid velocity, and temperature as functions of position. All other properties 
follow from the fact that the phase space density distribution is (locally) 
the thermal equilibrium distribution, e.g. Maxwcll-Boltzmann. This is not 
true for a collisionlcss gas, whose complete description requires specifying 
the full phase space density. 

For a collisionlcss gas, the velocity distribution function may be far from 
Maxwellian, so that the spatial stress tensor is not the simple diagonal form 
appropriate for an ideal gas. Instead there may be significant off-diagonal 
terms contributing shear stress that acts like viscosity in a weakly col- 
lisional fluid: it damps relative motions. We expect perturbations in a 
collisionless gas to be damped for wavelengths shorter than the distance 
traveled by particles with the characteristic thermal speed during one gravi- 
tational collapse time, the collisionlcss analogue of the Jeans length. Stated 
simply, overdense or underdense perturbations decay because the particles 
fly away from them at thermal speeds. This collisionless damping process 
is called free-streaming damping. 

The characteristic thermal speed of massive neutrinos after they become 
nonrelativistic is 

v th = Mil = 50.4(1 + z) {m v c 2 /eV)- x kms" 1 (3.1) 

where we have used the standard big bang prediction T v — (4/ll) 1 / 3 T 7 
(e.g., Kolb & Turner 1990) with T 7 w 2.735 K today. Multiplying v th 
by the gravitational time (inGpa 2 )" 1 / 2 gives the comoving free-streaming 
distance, 

A & = 0.41 {VLh 2 )- 1 ' 2 (1 + z) 1 ' 2 {m^/cV)- 1 Mpc . (3.2) 

At any time, fluctuations with wavelength less than about Af s are damped; 
much longer wavelength fluctuations grow with negligible suppression. 

The free-streaming distance does not really grow without bound asz-^ 
oo because the neutrino thermal speed cannot exceed c. Applying this limit 
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gives a maximum comoving free-streaming distance of 

A fs , ma x = 31.8 (Qh 2 )- 1/2 (m,c 2 /eV)- 1/2 Mpc . (3.3) 

Thus, unless they are regenerated by perturbations in other components 
(as happens, for example, in a model with hot and cold dark matter), 
primeval density fluctuations in massive neutrinos with wavelength smaller 
than this rather large scale will be erased by free-streaming damping. A 
more quantitative treatment is presented below using the actual evolution 
equations for the neutrino phase space density distribution. 



3.1. Tremaine-Gunn bound 



Before treating the phase space evolution, we discuss another important 
consequence of finite neutrino thermal speed: high-speed neutrinos cannot 
be tightly packed into galaxy halos. This fact can be used to place a lower 
bound on the neutrino mass if neutrinos make up the dark matter in galaxy 
halos (Tremaine & Gunn 1979). 

The initial phase space density for massive neutrinos is a relativistic 
Fermi-Dirac distribution (preserved from the time when the neutrinos de- 
coupled in the early universe): 

2h~ 3 
exp(pc/fc B io) + 1 



where p is the comoving canonical momentum of eq. (1.13), hp is Planck's 
constant (with a subscript to distinguish it from the scaled Hubble con- 
stant), and To = aT v is the present neutrino "temperature." The decrease 
of T v with time is compensated for by the factor a relating proper mo- 
mentum to comoving momentum. Ignoring perturbations, the present-day 
distribution for massive neutrinos is the relativistic Fermi-Dirac — not the 
equilibrium nonrelativistic distribution — because the phase space distri- 
bution was preserved after neutrino decoupling. 

Tremaine & Gunn (1979) noted that because of phase mixing (discussed 
further below), the maximum coarse-grained phase space density of mas- 
sive neutrinos today is less than the maximum of fo(p), hp 3 . If massive 
neutrinos dominate the mass in galactic halos, this must be no less than 
the phase space density needed for self-gravitating equilibrium. This bound 
can be used to set a lower limit on the neutrino mass if one assumes that 
the neutrinos constitute the halo dark matter. 

Although the neutrino mass bound is somewhat model-dependent be- 
cause the actual coarse-grained distribution in galactic halos is unknown, 
we can get a reasonable estimate by assuming an isothermal sphere: a 



26 



E. Bertschinger 



Maxwell-Boltzmann distribution with constant velocity dispersion a 2 (at 
a = 1 so that there is no distinction between proper and comoving): 

f(r,p) = (27rmla 2 )- 3 / 2 n(r) exp (^|-^) . (3.5) 

In a self-gravitating system there are a family of spherical density profiles 
p(r) = m v n{r) obeying hydrostatic equilibrium: 



1 dP _ GM(< r) _ AttG 
p dr r 2 r 



2 



r 2 p(r) dr . (3-6) 



o 



The simplest case is the singular isothermal sphere with p oc r~ 2 ; the reader 
can easily check that p = a 2 / (27rGr 2 ). Imposing the phase space bound at 
radius r then gives 

m v > (2tt)- 5 / 8 (Gh 3 P ar 2 y 1/4 . (3.7) 

Up to overall numerical factors, this is the Tremaine-Gunn bound. 

The singular isothermal sphere is probably a good model where the ro- 
tation curve produced by the dark matter halo is flat, but certainly breaks 
down at small radius. Because the neutrino mass bound is stronger for 
smaller or 2 , the uncertainty in the halo core radius (interior to which the 
mass density saturates) limits the reliability of the neutrino mass bound. 

For the Local Group dwarf galaxies in Draco and Ursa Minor, measure- 
ments of stellar velocity dispersions suggest a is a few to about 10 km s 
(Pryor & Kormendy 1990). If these galaxies have isothermal halos at r = 1 
kpc, the crude bound of eq. ( |3.7| ) implies m v is greater than a few eV. 

3.2. Vlasov equation 

We now present a rigorous treatment of the evolution of perturbations in a 
nonrelativistic collisionless gas, based on the evolution of the phase space 
distribution. The single-particle phase space density f(x,p,r) is defined 
so that fd 3 xd 3 p is the number of particles in an infinitesimal phase space 
volume element. We shall use comoving spatial coordinates x and the 



associated conjugate momentum p = amx (eq. 1.13). Note that d 3 xd 3 p = 
m 3 d 3 rd 3 v is a proper quantity so that / is the proper (physical) phase 
space density. 

If the gas is perfectly collisionless, / obeys the Vlasov (or collisionless 
Boltzmann) equation of kinetic theory, 

Df 9/ dx 9f dp df . . 

Dt or dr ox dr op 
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This equation expresses conservation of particles along the phase space tra- 



jectory {x(t),p(t)}. Using Hamilton's equations (1.16) for nonrelativistic 
particles, we obtain 

3/ , P 3/ 3/ 

— H amV</> ■ — = . (3.9) 

or a77i oa; op 

The Vlasov equation is supposed to apply for the coarse-grained phase 
space density for a collisionless gas in the absence of two-body correlations 
(Ichimaru 1992). Often, however, the statistical assumptions underlying 
the use of the Vlasov equation are vague. To clarify its application we 
digress to present a derivation using the Klimontovich (1967) approach to 
kinetic theory. 

Consider one realization of a universe filled with particles following phase 
space trajectories {xi(T),Pi(r)} (i labels the particles). The exact single- 
particle phase space density (called the Klimontovich density) is written 
by summing over Dirac delta functions: 

f(x,p, t)=J2 *[x ~ *i(r)] S\p - p,-(t)] . (3.10) 

i 

No statistical averaging or coarse-graining has been applied; / is the fine- 
grained density for one universe. This phase space density obeys the 
Klim ontovich (1967) equation, which is of exactly the same fo rm a s eq. 
(3.8) . T he proof follows straightforwardly from substituting eq. ( |3.1C ) into 



eq. | 

The Klimontovich density retains all information about the microstate 
of a system because it specifies the trajectories of all particles. This is far 
too much information to be practical. We must reduce the information 
content by performing some averaging or coarse-graining. This averaging 
is taken over a statistical ensemble of microstates corresponding to a given 
macrostate — for example, microstates with the same phase space density 
averaged over small phase space volumes containing many particles on av- 
erage. We denote the averages using angle brackets (), without being very 
precise about the ensemble adopted for the coarse-graining. 

The discreteness effects of individual particles are accounted for by the 
s-particle distribution functions (s = 1, 2, . . .) f s , which are defined using 
a standard cluster expansion: 

(f(x,p,r)) = (^2S(x - Xi) 6(p - pi) \ = fi(x,p,r) , (3.11) 
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(f(x ly p U T) f(x 2 ,p 2 ,r)) = 

S ( Xl ~ S ( Pl ~ Pi) S ( X 2 ~ x i)S{P2 -Pi)j + 

l^Z 6<yXi ~ x ^> s ( pi ~ p ^ ^ X2 ~ x ^> 6<yP2 ~ p ^ i ( 3 - 12 ) 

= 6(xi- x 2 )S(p 1 -p 2 )fx(xi,pi,T) + f 2 (x 1 ,p 1 ,X 2 ,P2,T) , 

and so on. We further write f 2 as a sum of uncorrelated and correlated 
parts, 

f 2 (xi,pi,x 2 ,p 2 ,r) = fx{xx,Px,T)fx{x 2 ,p 2 ,T)+f 2c {xx,px,x 2 ,p 2 ,T) .(3.13) 

This equation defines f 2c , known in kinetic theory as the irreducible two- 
particle correlation function. If there are no pair correlations in phase 
space, f 2c = 0. 

We now ensemble-average the Klimontovich equation, recalling that it 
is identical to eq. (3.9) provided we use the Klimontovich density. If <j> is 
a specified external potential, neglecting self-gravity, we see that fx obeys 
the Vlasov equation. However, if <f> is computed self-consistently from the 
particles, the mV0- (3/ /dp) term is quadratic in the Klimontov ich d ensity, 
yielding an additional correlation term from eqs. (3.12) and ( 3.13 ) after 
coarse-graining. This term is not present in the Vlasov equation. 

The contribution to the gravity field from the particles is (cf. eq. 1.11) 



-Vcj)(x,T) = J d 3 x' d 3 p' f(x',p',T) 



+ ^" a V dV F^F' (3 ' 14) 

where the second term, required in comoving coordinates, removes the 
contribution from the mean uniform background. 

Combining our results now yields the exact kinetic equation for the one- 
particle phase space density fx- 

dfx , P 3/i dfx 

— 1 • amV(p ■ —— 

c! v am ox op 

Gm 2 J d 3 x'd 3 p> fr~^ 3 • ^/ ac (s, P, p\ r) (3.15) 



{x - x') 

It 1 — n?' 3 
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where — is given by eq. (3.14) using fi for /, and adding any other 
contribution from other sources. Equation (3.15) is called the first BBGKY 
hierarchy equation (Peebles 1980, Ichimaru 1992). It differs from the Vlasov 
equation by a correlation integral term. 

If there are no phase space correlations, as would occur if we had a 
smooth collisionless fluid, then the one-particle or coarse-grained distri- 
bution obeys the Vlasov equation of kinetic theory. Correlations may be 
introduced by gravitational clustering, which couples /2c to f\. One may 
derive an evolution equation for ]i c — the second BBGKY hierarchy equa- 
tion — by averaging /9//3t, but it involves fs c , and so on. The result is 
an infinite hierarchy of coupled kinetic equations, the BBGKY hierarchy. 

For some cases, Boltzmann's hypothesis of molecular chaos may hold, 
implying fac = except at binary collisions, with the right-hand side 
of eq. (3.15) becoming a Boltzmann collision operator. Fortunately, for 
the particles of interest here — neutrinos — the gravitational (and non- 
gravitational, after neutrino decoupling) collision time is so long that the 
correlation integral is completely negligible. Thus, hot dark matter com- 
posed of massive neutrinos obeys the Vlasov equation after decoupling. 
From now on we shall drop the subscript 1 from /. 

We now return to our main line of development to discuss phase mixing. 
The Vlasov equation implies conservation of phase space density, but a 
given initial volume d 3 xd 3 p evolves in a complicated way (i.e., the trajec- 
tories of particles initially inside this volume may be highly complicated). 
Consider the initial phase space element shown in Figure 2a, extracted from 
a one-dimensional iV-body simulation. Figures 2b and 2c show the phase 
space distribution at a later time, with each particle's trajectory evolved 
according to Hamilton's equations without (Fig. 2b) and with (Fig. 2c) 
gravity, respectively. In both cases the area dxdp of the phase space element 
is identical to the initial area as a consequence of the Vlasov equation. 

Figure 2c illustrates the process known as phase mixing: the phase space 
structure becomes highly convoluted as particles make multiple orbits. Re- 
gions of initially high phase space density can end up entwined with regions 
of initially low phase space density. Although the density is conserved along 
each phase space trajectory, if the distribution is coarse-grained (averaged 
over finite phase space volume) , the resulting coarse-grained density is not 
conserved. The maximum coarse-grained density can only decrease, as we 
noted previously in the discussion of the Tremaine-Gunn bound. 

The process of phase- mixing is complicated, and the only practical means 
of integrating the Vlasov equation for such an evolved collisionless system 
is by TV-body simulation: the phase space is sampled with discrete particles 
at some initial time and the particle trajectories are computed, providing 
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Fig. 2. Phase space evolution, (a) Initial conditions, (b) Evolved state without gravity, 
(c) Evolved state with gravity. 

a sample of the evolved phase space. However, analytical methods can be 
used while the phase space distribution is only slightly perturbed from the 
homogeneous equilibrium distribution. These methods, presented in the 
next two subsections, will help us to understand free-streaming damping 
in detail. 



3.3. Nonrelativistic evolution in an external gravitational field 

In this section we consider hot dark matter made of nonrelativistic massive 
neutrinos with Vl v <C ft so that their self-gravity is unimportant. The 
gravitational potential (f>(x,r) (using comoving coordinates) is assumed to 
be given from other sources such as cold dark matter in a mixed hot and 
cold dark matter model. 

We can solve the Vlasov equation ( |3.9| ) approximately by replacing 
df/dp with the unperturbed term d fa/dp. This approximation is valid 
for | / — /o| -C /oj and should suffice to demonstrate the collisionless damp- 
ing of small-amplitude fluctuations. 

A quadrature solution of the Vlasov equation can be obtained provided 
that we change the time variable from t to s — J dr/a — J dt/a 2 and then 
Fourier transform the spatial variable: 

f(x,p,r(s)) = J d 3 ke* k x f(k,p,s) . (3.16) 



The gravitational potential cj> is transformed similarly. Integrating eq. (3.9) 
over s, we obtain the solution 
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f(k,p, s) = f(k,p,Si)e 



— ik-u(s — Si ) 



-im 



k-^fi-) f ds' a 2 (s'U(k,s')e- ik - u ^- s ^ , (3.17) 



where u — p/m and Si is an initial time. If the initial phase space distribu- 
tion is unperturbed, then f(k,p,Si) — /o(p)<5(fc). Note that the complex 
exponentials in eq. (3.17) correspond to the propagation of the phase space 
density along the characteristics dx/ds = u. This motion is called free- 
streaming. 

To understand the behavior of the free-streaming solution, let us examine 
the integral term of eq. (3.17), which is proportional to 

/ 1 dya 2 (y + Sl )^(k,y + Sl )e-^y , (3.18) 
Jo 

where [3 = k-p/m and y — s' — s,-. For sufficiently slowly moving neutrinos, 
(3 is small enough so that j3y <C 1. This condition corresponds to a free- 
streaming distance along k that is much less than k . These neutrinos do 
not move far from the crests and troughs of the plane wave perturbation. 
Neglecting the exponential, the time dependence of the solution is the same 
as for cold dark matter. 

If, however, fly 3> 1, corresponding to neutrinos traveling across many 
wavelengths of a perturbation, the rapid oscillations of the exponential 



lead to cancellation in the integrand of eq. ( 3.18 ) and suppression of the 
neutrino phase space density perturbation. This effect, known as free- 
streaming damping, occurs because neutrinos that are initially at the crests 
or troughs of density waves move so far that they distribute themselves 
almost uniformly. The small gravitational acceleration induced by the ex- 
ternal potential is inadequate to collect the fast-moving neutrinos in dense 
regions. 

Thus, perturbations can grow only for the neutrinos that move less than 
about one wavelength per Hubble time. Our analysis confirms the rough 
picture we sketched in the beginning of this lecture. 

We can obtain the net density perturbation (in Fourier space) by inte- 
grating eq. (3.17) over momenta: 



n„(fc, s) _ 1 



J d 3 P f(k,p,s) 

k{s - s') 



n Q n Q 
= 5(k)-k 2 I ds a 2 (s')ci>(k : s')(s- s')F 



, (3.19) 
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where no = J d 3 p /o (p) is the mean comoving number density and F is the 
Fourier transform — with respect to the momentum! — of the unperturbed 
distribution function: 

F(q) = - fcPpe-^foip) . (3.20) 
n-0 J 

For the relativistic Fermi-Dirac distribution appropriate to hot dark matter, 
F has the series representation (Bertschinger & Watts 1988) 

3C(3)^ V ^ (n 2 +«- 



where £(3) = 1.202 ... is the Riemann zeta function and F(0) = 1. 

Equation (3.19) does not give much insight into free-streaming damping. 
To get a better feel for the physics, as well as a simpler approximation 
for treating hot dark matter, we now show how to convert eq. (3.19) into 
a differential equation for the evolution of the hot dark matter density 



perturbation similar to eq. (2.12) for a perfect collisional fluid. This may 
seem impossible a priori — how can the dispersive behavior of a collisionless 
gas be represented by fluid-like differential equations? — but we shall see 
that it is possible if we approximate fo(p) by a form differing slightly from 
the Fermi-Dirac distribution. The results, although not exact, will give us 
additional insight into the behavior of collisionless damping. 

The first step is to rewrite eq. (3.19) for the Fourier transform of the 
density fluctuation 5 V : 

6Jk,s) = -km [ ds a 2 (s')4>(k,s')[qF(q)} , q = ^ - - . (3.22) 

m 

Next, we differentiate twice with respect to the time coordinate s: 

= ~fc 2 f ds' a 2 {s') $(k, s') ^lqF(q)} , (3.23) 



|% =-k 2 a\s)Uk,s) 

f ds ' a 2( s ') 0( fcj s >) ^[qF(q)} . (3.24) 
ni J Si uq 

Note the appearance of a non-integrated source term in the second deriva- 
tive, arising because d(qF)/dq does not vanish at s = s' (q = 0) while qF 
does. 
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Next, we note that if d 2 (qF)/dq 2 were to equal a linear combination of 
d(qF)/dq and (qf), then we could write the integral in equation (3.24) as a 
linear combination of dS u /ds and 8 U . Unfortunately, this is not the case for 



F(q) given by eq. (3.21). However, it is true for the family of distribution 



functions whose Fourier transforms are 

F^q) =exp(-7« 7 p ) , (3.25) 

for any dimcnsionlcss constant 7. This defines the family of phase space 
density distributions 

AM = "»/(§! 't<«> = ^ (1 + ■ <»■») 

For this form of unperturbed distribution we have 



(<zF 7 ) = -2 7Po —(qF y ) - ( 7 Po)V-y . (3.27) 
Combining eqs. fl3~22])-(3.24) and ( p7| ), we get 



-— r + 2 1 „— 5 V = -k a (s)4>(k,s) . (3.28) 

os z m os m 

To put this result into a form similar to the acoustic wave equation we 
derived for a collisional fluid, we define the characteristic proper thermal 
speed 

_ k B T„ 7p ,„ on ^ 

c v = 7 = . (3.29) 

mc ma 

Next, we change the time variable from s back to r with dr/ds = a. 
Finally, we assume that the source term gravitational potential (f> is given 
by the Poisson equation for a perturbation S c in a component with mean 
mass density p c (e.g., cold dark matter — recall that we are neglecting the 
self- gravity of the neutrinos). Dropping the hat on 5 U , the result is 

^ + 2fcc„^ 5 V + k 2 c 2 J v = 4nGa 2 p c S c . (3.30) 

This equation was first derived by Setayeshgar (1990). It is approximate 
(not exact) for the linear evolution of massive neutrinos because we replaced 
the Fermi- Dirac distribution by eq. ( |3.26| ). It is not difficult to show 
that eq. ( 3.26| ) is the only form of the distribution function for which 



eq. (3.17) can be reduced to a differential equation for <5„(fc,r). (Even 
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the Maxwell-Boltzmann distribution fails — a collisionless gas with this 
distribution initially does not evolve the same way as a collisional gas with 
the Maxwell-Boltzmann distribution function for all times.) One should 
also bear in mind that 8 V does not contain all the information needed to 
characterize perturbations in a collisionless gas (Ma & Bertschinger 1994a). 
Complete information resides in f(Jk,p, s). 



Even if eq. (3.3C) is not exact for massive neutrinos and does not fully 
specify the perturbations, it provides an extremely helpful pedagogic guide 
to the physics of collisionless damping. We see at once that a gravitational 
source can induce density perturbations in a collisionless component, but 
the source competes agains acoustic (& 2 c^) and damping (a/a+2kc u ) terms. 
Roughly speaking, hot dark matter behaves like a collisional gas with an 
extra free-streaming damping term. 

Does the k 2 c 2 term imply that a collisionless gas can support acoustic 
oscillations? To check this we consider the limit kc v T > 1 so that the 
Hubble damping and gravitational source terms are negligible. We then 
have 

5 V + 2lj v 5 v + w v 8 u « , u v — kc v . (3.31) 
Because ui v changes very slowly with time compared with the oscillation 



timescale ui 1 , eq. ( 3.31 ) is a linear differential equation with constant 



coefficients and is easily solved to give the two modes 

5 V oc Te- UvT or e _avr , uj v t > 1 . (3.32) 

Neither solution oscillates! The first one begins to grow but is rapidly 
damped on a timescale w^ 1 , after the typical neutrino has had time to 
cross one wavelength. 

Because the damping time {kc^ 1 is proportional to the wavelength, 
short- wavelength perturbations are damped most strongly. At any given 
time r, perturbations of comoving wavelength less than about c„r are at- 
tenuated. This is precisely the free-str eam ing distance we introduced in 
the beginning of this lecture, equation (^^). 

Our results enable us to understand why the hot dark matter trans- 
fer function is similar to that of cold dark matter for long wavelengths 
but cuts off sharply for short wavelengths (Bond & Szalay 1983). During 
the radiation-dominated era, a(r) cx r. While the massive neutrinos were 
relativistic, c v w c was constant. The comoving free-streaming distance 
increased, c u t cx a, with hot dark matter perturbations being erased on 
scales up to the Hubble distance. After the neutrinos became nonrelativis- 
tic, however, c„ is given by eq. ( |3.29 ), c v oc a -1 . Thus, the free-streaming 
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distance saturates at the Hubble distance when the neutrinos become non- 
relativistic. During the matter-dominated era, a(r) oc t 2 (while f2 ~ 1) 
so that the free-streaming distance decreases: c„t oc a -1 / 2 . However, free- 
streaming has already erased the hot dark matter perturbations on scales 
up to the maximum free-streaming distance, eq. ( |3.3| ). Only if the pertur- 
bations are re-seeded, e.g. by cold dark matter or topological defects, will 
small-scale power be restored to the hot dark matter. 

3.4- Nonrelativistic evolution including self-gravity 

Now that we have developed the basic techniques for solving the linearized 
nonrelativistic Vlasov equation, adding self-gravity of the collisionless par- 
ticles is easy. We simply add a contribution to (f> arising from <5„. In eq. 
(3.17), if we have a mixture of hot and cold dark matter, 4> — > (4> c + <t> v ) ; 
additional contributions may be added as appropriate. Equation ( 3.22| ) 
becomes 



S„(k, s) = j J ds' a 2 {s') [qF(q)} ATiGa 2 {s') 

x p c (s')S c (k,s') + p u ( s %(k, s ')\ . (3.33) 

This equation was first derived (in a slightly different form) by Gilbert 
(1966) and is known as the Gilbert equation. Note that in the self- 
gravitating case 5 U appears both inside and outside an integral. Equation 
(3.33) is a Volterra integral equation of the second kind. Bcrtschinger & 
Watts (1988) present a numerical quadrature solution method. 

Using the same trick as in the previous subsection, we can convert the 
Gilbert equation to a differential equation for 8 V , if the unperturbed phase 



space density distribution is approximated by the form / 7 (p) of eq. (3.26). 
The result is 

5 V + ^ + 2kc^j K + k 2 c 2 v b v = AnGa 2 [p c S c + p v S v ] . (3.34) 



With a suitable choice for the parameter 7, the solution of eq. ( 3.34 ) 
provides a good match (to within a few percent, in general) to the solu- 
tion of the Gilbert equation using the correct Fermi-Dirac distribution for 
massive neutrinos (Setayeshgar 1990). Therefore, it may be used for ob- 
taining quick estimates of the density perturbations of nonrelativistic hot 
dark matter. 
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4. Relativistic cosmological perturbation theory 

4-1. Introduction 

This section is an expanded version of my fifth lecture at Les Houches. 
One lecture gave barely enough time to introduce the essential ideas of 
relativistic perturbation theory: classification of metric perturbations, the 
linearized Einstein equations, and gauge modes. Understanding the physics 
of these topics, as well as the relativistic generalizations of my previous 
lectures, requires a much deeper immersion. Unable to find a pedagogical 
treatment in the existing literature that matches these needs to my sat- 
isfaction, I have developed the subject more fully in these written lecture 
notes. They are not a complete guide to relativistic perturbation theory but 
rather a starting point from which the reader may delve into the increas- 
ingly rich literature of applications. This section is self-contained and may 
be read independently of the previous sections, although the reader may 
find it interesting to contrast the nonrelativistic presentations of sections 1 
and 2 with the relativistic treatment given below. 

4-1.1. Synopsis 

According to the Newtonian perspective of gravity and cosmology space- 
time is flat and absolute, gravity is action at a distance, and particle 
dynamics is given by Newton's second law F — ma or, equivalcntly, by 
Hamilton's principle of least action. The Einsteinian perspective is quite 
different: spacetime is a curved manifold which evolves causally through 
the Einstein field equations in response to sources, and particle dynamics 
is given in absence of nongravitational forces by geodesic motion. In this 
section I attempt not only to present the essentials of relativistic gravita- 
tional dynamics, but also to show how it reduces to and extends Newtonian 
cosmology in the appropriate limit. 

One of the main purposes of these notes is to provide a clear explanation 
of the scalar, vector, and tensor modes of gravitational perturbations. (We 
shall follow the customary usage in this subject by referring to different spa- 
tial symmetry components as "modes" even when they are not expanded 
in any basis eigenfunctions. Thus, the "scalar mode" is described, in part, 
by a field 4>{x^) that is a scalar under spatial coordinate transformations 
but is not restricted to being a single Fourier component or other harmonic 
basis function.) Newtonian gravity corresponds to the former (the scalar 
mode), while the latter (vector and tensor modes) represent the relativis- 
tic effects of gravitomagnetism and gravitational radiation, which have no 
counterpart in Newtonian gravity although they are similar to electromag- 
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netic phenomena. If the motion of sources is expanded in powers of v/c, 
the vector and tensor gravitational fields are 0(v/c) and 0(v/c) 2 times the 
Newtonian field, respectively. On terrestrial scales the vector and tensor 
modes are extremely weak — they have not been detected in the laboratory, 
although satellite experiments are planned to search for the former through 
the Lense-Thirring "gravitomagnetic moment" precession, and large inter- 
ferometric detectors are being built to measure gravitational radiation — 
but they could have important consequences for the evolution of large-scale 
matter and radiation fluctuations, including the production of anisotropy 
in the microwave background radiation. 

The Newtonian limit corresponds to weak gravitational fields (black holes 
are to be avoided) and slow motions (v 2 <C c 2 , for both sources and test 
particles). For nearly all cosmological applications it is sufficient to consider 
only weak fields — small perturbations of the spacetime metric around a 
homogeneous and isotropic background spacetime. At the same time it is 
usually safe to assume that the gravitational sources are nonrelativistic, 
although the test particles (e.g., photons) need not be. Because the weak- 
field, slow source motion limit does not necessarily imply small density 
fluctuations, we can (and will) investigate nonlinear particle and fluid dy- 
namics even while treating the metric perturbations and source velocities 
as being small. 

In sections 4.2-4.5 we shall develop the machinery for cosmological 
perturbation theory using the methods developed by Lifshitz, Peebles, 
Bardeen, Kodama & Sasaki, and others. We discuss the consequences of 
gauge invariance — the invariance of physical quantities to small changes 
in the spacetime coordinates — and summarize the standard results in the 
synchronous gauge of Lifshitz (1946) n In section 4.6 we introduce a new 
gauge that clarifies how general relativity extends Newtonian gravity in 
the weak-field limit and in section 4.7 we attempt to clarify the physical 
content of general relativity theory in this limit. In section 4.8 we shall see 
how simply and clearly the Hamiltonian formulation of particle dynamics 
follows from general relativity. Finally, in section 4.9 we introduce an alter- 
native fully nonlinear formulation of general relativity due to Ehlers, Ellis 
and others, and we demonstrate its connection with the Lagrangian fluid 
dynamics that was discussed in my fourth lecture. 



* Apparently it is not widely known that Lifshitz' paper is published in English and is 
available in many libraries. This classic paper was remarkably complete, including a full 
treatment of the scalar, vector, and tensor decomposition in open and closed universes 
and a concise solution to the gauge mode problem; it presented solutions for perfect fluids 
in matter- and radiation-dominated universes; and it contrasted isentropic (adiabatic) 
and entropy fluctuations. 
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We shall not discuss the relativistic Boltzmann equation nor the classifi- 
cation of isentropic and isocurvature initial conditions. In the nonrelativis- 
tic limit, these topics have already been covered in my preceding lectures. 
Neither shall we discuss the physics of microwave background anisotropy 
or the evolution of perturbations in specific models. Our aim here is to de- 
rive and comprehend the gravitational field equations, not their solution. 
Although this goal is restricted, we shall see that the physical content is 
sufficiently rich. After working through these notes the reader may wish to 
consult one of the many books or articles discussing the detailed evolution 
for a variety of models (e.g., Lifshitz & Khalatnikov 1963; Peebles & Yu 
1970; Weinberg 1972; Peebles 1980; Press & Vishniac 1980; Wilson & Silk 
1981; Wilson 1983; Bond & Szalay 1983; Zel'dovich & Novikov 1983; Ko- 
dama & Sasaki 1984, 1986; Efstathiou & Bond 1986; Bond & Efstathiou 
1987; Ratra 1988; Holtzman 1989; Efstathiou 1990; Mukhanov, Fcldman & 
Brandenberger 1992; Liddle & Lyth 1993; Peebles 1993; Ma & Bertschinger 
1994b). 

Understanding these notes will not require much experience with general 
relativity, although some background is helpful. The reader can test the 
waters by examining the following summary of essential general relativity 
and differential geometry. While some mathematical formalism is needed to 
get started, the focus thereafter will remain as much as possible on physics. 

4-1-2. Summary of essential relativity 

We adopt the following conventions and notations, similar to those of Mis- 
ncr, Thorne & Wheeler (1973). Units are chosen so that c = 1. The 
metric signature is ( — ,+,+,+). The unperturbed background spacetime 
is Robertson- Walker with scale factor a(r) expressed in terms of conformal 
time. A dot (or 3 T ) indicates a conformal time derivative. The comoving 
expansion rate is written t](t) = a/ a = aH. The scale factor obeys the 
Friedmann equation, 

V 2 = f Ga 2 p-K. (4.1) 

The Robertson- Walker line element is written in the general form using 
conformal time r and comoving coordinates x l : 

ds 2 = g^dx^dx" = a 2 (r) [-dr 2 + j l3 (x k )dx l dx j ] . (4.2) 

Latin indices (i, j, k, etc.) indicate spatial components while Greek in- 
dices (p, v, A, etc.) indicate all four spacetime components; we assume a 
coordinate basis for tensors. Summation is implied by repeated upper and 
lower indices. The inverse 4-metric g^ v (such that g^ v g VK — 5^ K ) is used 
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to raise spacetime indices while the inverse 3-metric 7 y (7 y 7j/c = S l k ) is 
used to raise indices of 3-vectors and tensors. Three-tensors are defined in 
the spatial hypersurfaces of constant r with metric 7^ and they shall be 
clearly distinguished from the spatial components of 4-tensors. We shall 
see as we go along how this "3+1 splitting" of spacetime works when there 
are metric perturbations. 

Many different spatial coordinate systems may be used to cover a 
uniform-curvature 3-space. For example, there exist quasi-Cartesian co- 
ordinates (x,y, z) in terms of which the 3-metric components are 

(4.3) 

We shall use 3-tensor notation to avoid restricting ourselves to any partic- 
ular spatial coordinate system. Three-scalars, vectors, and tensors are in- 
variant under transformations of the spatial coordinate system in the back- 
ground spacetime (e.g., rotations). A 3-vector may be written A = A % e,i 
where is a basis 3-vector obeying the dot product rule • e,- = 7^ . A 
second-rank 3-tensor may be written (using dyadic notation and the tensor 
product) h = h^ei ® ej. We write the spatial gradient 3-vector operator 
V = e l di (3j = d/dx l ) where e l ■ ej — S l j. The experts will recognize e l as 
a basis one- form but we can treat it as a 3-vector e % = 7*- 7 'e J - because of the 
isomorphism between vectors and one-forms. Because the basis 3-vectors 
in general have nonvanishing gradients, we define the covariant derivative 
(3-gradient) operator Vj with 'Vijjk = 0. If the space is flat (K = 0) and 
we use Cartesian coordinates, then 7^ = 5ij, Vj = 3,, and the 3-tensor 
index notation reduces to elementary Cartesian notation. If K 7^ 0, the 
3-tensor equations will continue to look like those in flat space (that is 
why we use a 3+1 splitting of spacetime!) except that occasionally terms 
proportional to K will appear in our equations. 

Our application is not restricted to a flat Robertson- Walker background 
but allows for nonzero spatial curvature. This complicates matters for two 
reasons. First, we cannot assume Cartesian coordinates. As a result, for 
example, the Laplacian of a scalar and the divergence and curl of a 3-vector 
involve the determinant of the spatial metric, 7 = det{7ij}: 

v 2 «^7- 1/2 M7 v W) , v • v = 7 - 1 /2 9i (y/v) , 

V xv = e^ k (d iVj )e k , (4.4) 

where e yfe = 7 -1 / 2 [ijk] is the three-dimensional Levi-Civita tensor, with 
[ijk] = +1 if {ijk} is an even permutation of {123}, [ijk] = —1 for an 
odd permutation, and if any two indices are equal. The factor 7 -1 / 2 



= 5,. 



1 + ^ (x 2 + y 2 + z 2 ) 
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ensures that t % ^ k transforms like a tensor; as an exercise one can show that 
eijk = 7 1/2 [ijk]- 

The second complication for K ^ is that gradients do not commute 
when applied to 3-vectors and 3-tensors (though they do commute for 3- 
scalars). The basic results are 

[V fc , V,] fr« = ^R\ kl h^ + ^R J nkl h™ , (4.5) 

where [Vj, Vfc] = (Vj Vfc — VfeVj). The commutator involves the spatial 
Riemann tensor, which for a uniform-curvature space with 3-metric 7^ is 
simply 

^W ]kl = K (5\ l3l - 5\ ljk ) . (4.6) 

Finally, we shall need the evolution equations for the full spacetime met- 
ric g^v These are given by the Einstein equations, 

G^ v = 8ttG T^y , (4.7) 

where T^ v is the stress-energy tensor and G^ v is the Einstein tensor, related 
to the spacetime Ricci tensor by 

Gfiu — R/mv ^"3m i/ ' ^ = R^fj, j Rpv = R \iki> ■ (4-8) 

The spacetime Riemann tensor is defined according to the convention 

n vkX — ° Ki vX ° Ai vk ' 1 otK uX 1 aX L vk > l^- y ^ 

where the affine connection coefficients are 

r"„ A = \g» K {d v g KX + d x g Kl/ - d K g„ x ) . (4.10) 
We see that the Einstein tensor involves second derivatives of the metric 



tensor components, so that eq. (4.7) provides second-order partial differ- 
ential equations for g^ v . 

The reader who is not completely comfortable with the material sum- 
marized above may wish to consult an introductory general relativity text- 
book, e.g. Schutz (1985). 

4-2. Classification of metric perturbations 

Now we consider small perturbations of the spacetime metric away from 
the Robertson- Walker form: 

ds 2 = a 2 (r) {-(1 + 2iP)dr 2 + 2w l drdx i + [(1 - 20) 7y - + 2/i y ] dxW} , 

yVhij = . (4.11) 
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We have introduced two 3-scalar fields ip(x,r) and </>(x,t), one 3- vector 
field w(x,t) = Wie 1 , and one symmetric, traceless second-rank 3-tensor 
field h(x,r) = h^e 1 ® e? . No generality is lost by making traceless 
since any trace part can be put into (f>. The factors of 2 and signs have 
been chosen to simplify later expressions. 

Equation (4.11) is completely general: g^ has 10 independent compo- 
nents and we have introduced 10 independent fields (1 + 1 + 3 + 5 for 
ip + 4> + w + h). In fact, only 6 of these fields can represent physical de- 
grees of freedom because we are free to transform our 4 coordinates (t, x' 1 ) 
without changing any physical quantities. Infinitesimal coordinate trans- 
formations, called gauge transformations, result in changes of the fields 
(ip, <j), w, h) because the spacetime scalar ds 2 — g^dx^dx" must be in- 
variant under general coordinate transformations. We shall explore the 
consequences of this invariance later. Coordinate invariance complicates 
general relativity compared with other gauge theories (e.g., electromag- 
nctism) in which the spacetime coordinates are fixed while other variables 
change under the appropriate gauge transformations. 

Unless stated explicitly to the contrary, in the following we shall treat 
the perturbation variables (if), <fr,Wi, hij) exclusively as 3-tensors (of rank 
0, 1, or 2 according to the number of indices) with components raised and 
lowered using 7 y and jij. In doing this we choose to use 7^ as the 3- 
metric in the perturbed hypersurface of constant r despite the fact that 
the spatial part of the 4-metric (divided by a 2 ) is given by (1 — 2^)7^ + 
2hij . This treatment is satisfactory because we will assume that the metric 
perturbations are small and we will neglect all terms quadratic in them. 
However, we will use g^ to raise 4- vector components: G^ v — g^ K G KV . Do 
take care to distinguish Latin from Greek! 

We have introduced 3-scalar, 3-vector, and 3-tensor perturbations. 
(From now on we will drop the prefix 3- since it should be clear from 
the context whether 3- or 4- is implied.) Are these the famous scalar, vec- 
tor, and tensor metric perturbations? Not quite! Recall the decomposition 
of a vector into longitudinal and transverse parts: 

w = to 1 1 + w± , V x wu = V • wj_ = . (4-12) 

Since wu = -Vic for some scalar w, how can it be called a vector per- 
turbation? By definition, only the transverse component w± represents a 
vector perturbation. 

There is a similar decomposition theorem for tensor fields: Any diffcrcn- 
tiable traceless symmetric 3-tensor field hij(x) may be decomposed into a 
sum of parts, called longitudinal, solenoidal, and transverse: 

h(su) = h,| +h ± + h T . (4.13) 
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The various parts are defined in terms of a scalar field h[x) and transverse 
(or solenoidal) vector field h(x) such that 

h {j , || = Dijh , hij, j_ = V^hj) , Vih l o T = , (4.14) 

where we have denoted symmetrization with parentheses and have em- 
ployed the traceless symmetric double gradient operator: 

V (i /ij) = - (Vihj + Vjhi) , Da = ViVj - - 7i3 ■ V 2 . (4.15) 

Note that the divergences of h|| and are longitudinal and transverse 
vectors, respectively (it doesn't matter which index is contracted on the 
divergence since h is symmetric): 

V-h[, = | V(V 2 + 3K)h , V-h ± = \{V 2 + 2K)h , (4.16) 

where V 2 /i = (V 2 /i')e.j. (We do not call the transverse part, as we 
would by extension from w± , because "transverse" is conventionally used 
to refer to the tensor part.) The longitudinal tensor hp is also called the 
scalar part of h, the solenoidal part is also called the vector part, and 
the transverse-traceless part hx is also called the tensor part. This clas- 
sification of the spatial metric perturbations hij was first performed by 
Lifshitz (1946). 

The purpose of this decomposition is to separate hij into parts that 
can be obtained from scalars, vectors, and tensors. Is the decomposition 
unique? Not quite. It is clear, first of all, that h and hi arc defined only 
up to a constant. But there may be additional freedom (Stewart 1990). 

First, the vector h is defined only up to solutions of Killing's equation 
'Vihj + Vj/ij = 0, called Killing vectors (Misner et al. 1973). The reader 
can easily verify that one such solution (using the quasi-Cartesian coordi- 
nates of eq. fO| ) is (h x , h y , h z ) — (y,—x,0). In an open space [K ^ 0) 
this solution would be excluded because it is unbounded — our perturba- 
tions should not diverge! — but in a closed space (K > 0) the coordinates 
have a bounded range. This Killing vector, and its obvious cousins, cor- 
respond to global rotations of the spatial coordinates and not to physical 
perturbations. 

Next, there may also be non-uniqueness associated with the tensor (and 
scalar) component: 

Kj, t -> h^ T + &j , Qj = [Vi Vj - 7 y (V 2 + 2K)] ( , (4.17) 

where C is some scalar field. From eqs. (4.5) and ( |4.6| ) one can show 
V 2 (VjC) = V^V 2 + 2K)( so that ViC'j = as required for the tensor 
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component. However, we also require hij t i to be traceless, implying (V 2 + 
3K)( = 0. Thus, the tensor mode is defined only up to eq. (4.17) with 
bounded solutions of (V 2 + 3K)( = 0. In fact, this condition also implies 
Qj = -Dy Ci so we may equally well attribute Qj to the scalar mode h^^ u. 
Thus, we are free to add any multiple of £ to h (the scalar mode) provided 
we subtract -D^ C from the tensor mode. In an open space (K 0) there are 
no nontrivial bounded solutions to (V 2 + 3K )£ = but in a closed space 
(K > 0) there are four linearly independent solutions (Stewart 1990). Once 
again, these solutions correspond to redefinitions of the coordinates with no 
physical significance. Kodama & Sasaki (1984, Appendix B) gave a proof 
of the tensor decomposition theorem, but they missed the additional vector 
and scalar/tensor mode solutions present in a closed space. In practice, it 
is easy to exclude these modes, and so we shall ignore them hereafter. 

Thus, we conclude that the most general perturbations of the Robertson- 
Walker metric may be decomposed at each point in space into four scalar 
parts each having 1 degree of freedom (i/>, </>, Wu , hii), two vector parts each 
having 2 degrees of freedom (w±, hjj, and one tensor part having 2 de- 
grees of freedom (h-r, which lost 3 degrees of freedom to the transversality 
condition). The total number of degrees of freedom is 10. 

Why do we bother with this mathematical classification? First and fore- 
most, the different metric components represent distinct physical phenom- 
ena. (By way of comparison, in previous lectures we have already seen 
that and v± play very different roles in fluid motion.) Ordinary New- 
tonian gravity obviously is a scalar phenomenon (the Newtonian potential 
is a 3-scalar), while gravitomagnetism and gravitational radiation — both 
of which are absent from Newton's laws, and will be discussed below - 
are vector and tensor phenomena, respectively. Moreover, this spatial de- 
composition can also be applied to the Einstein and stress-energy tensors, 
allowing us to see clearly (at least in some coordinate systems) the physical 
sources for each type of gravity. Finally, the classification will help us to 
eliminate unphysical gauge degrees of freedom. There are at least four of 
them, corresponding to two of the scalar fields and one transverse vector 
field. 

We will not write the weak-field Einstein equations for the general metric 
of eq. (4.11). Instead, we will consider only two particular gauge choices, 
each of which allows for all physical degrees of freedom (and more, in the 
case of synchronous gauge). First, however, we must examine the stress- 
energy tensor. 
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4-3. Stress-energy tensor 



The Einstein field eqs. (4.7) show that the stress-energy tensor provides 
the source for the metric variables. For a perfect fluid the stress-energy 
tensor takes the well-known form 

T» u = {p + p)u<*u v + pg^ , (4.18) 

where p and p are the proper energy density and pressure in the fluid rest 
frame and u M — dx^/dX (where dX 2 = —ds 2 ) is the fluid 4- velocity. In 
any locally flat coordinate system, T 00 represents the energy density, T° l 
the energy flux density (which equals the momentum density T l °), and T l] 
represents the spatial stress tensor. In locally flat coordinates in the fluid 
frame, T 00 = p, T oi = 0, and T« = p5*i for a perfect fluid. 

For an imperfect fluid such as a sum of several uncoupled components 
(e.g., photons, neutrinos, baryons, and cold dark matter), the stress-energy 
tensor must include extra terms corresponding in a weakly collisional gas to 
shear and bulk viscosity, thermal conduction, and other physical processes. 
We may write the general form as 

= (p + p)u>*u v + pg 11 " + . (4.19) 

Without loss of generality we can require £ M1/ to be traceless and flow- 
orthogonal: S M M = 0, Y^ v u v = 0. In locally flat coordinates in the fluid 
rest frame only the spatial components Y? 3 are nonzero (but their trace 
vanishes) and the spatial stress is T lJ — p8 13 + S y . With these restrictions 
on (in particular, the absence of a S° l term in the fluid rest frame) 
we implicitly define w M so that pu 11 is the energy current 4-vector (as op- 
posed, for example, to the particle mass times the number current 4-vector 
for the baryons or other conserved particles). As a result of these condi- 
tions, pu^ includes any heat conduction, p includes any bulk viscosity (the 
isotropic stress generated when an imperfect fluid is rapidly compressed or 
expanded) , and (called the shear stress) includes shear viscosity. Some 



workers add to eq. (4.1£) terms proportional to the 4- velocity, q^u v +u^q v , 
where q* 1 is the energy current in the particle frame (taking u M to be pro- 
portional to the particle number current). Either choice is fully general, 
although our choice is the simplest. 

We shall need to evaluate the stress-energy components in the comoving 
coordinate frame implied by eq. (4.11). This requires specifying the form 
of the 4- velocity u M . Therefore we must digress to discuss the 4- velocity 
components in a perturbed spacetimc. 

Consider first the case where the fluid is at rest in the comoving frame, 
i.e., u l — 0. (This condition defines the comoving frame.) Normalization 
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{g^ v u tl u u = — 1) then requires u° = a (1 — ip) to first order in ip. Lowering 
the components using the full 4-metric gives uo = —a(l + ip) and Ui = awi 
in the weak-field approximation. 

The appearance of ip and Wi in the components for a fluid at rest in the 
comoving frame may appear odd. They arise because, in our coordinates, 
clocks run at different rates in different places if Vji/> 7^ (the coordi- 
nate time interval dr corresponds to a proper time interval o(t)(1 + ip)dr) 
and they also have a position-dependent offset if Wi ^ (an observer at 
x l = constant sees the clocks at x l + dx l running fast by an amount Widx 1 ). 
At first these may seem like strange coordinate artifacts one should avoid 
(this may be a motivation for the synchronous gauge in which ip = Wi = 0!) 
but they have straightforward physical interpretations: ip represents the 
gravitational redshift and Wi represents the dragging of inertial frames. We 
shall see later that they also can be interpreted as giving rise to "forces," 
allowing us to apply Newtonian intuition in general relativity. Do not for- 
get that in general relativity we are forced to accept coordinates whose 
relation to proper times and distances is complicated by spacetime curva- 
ture. Therefore, it is advantageous when we can reinterpret these effects in 
Newtonian terms. 

We define the coordinate 3-velocity 

dx dx l u 1 

whose components are to be raised and lowered using 7 y and 7^ : Vi — 
Jijvi — -fijiii /u°, v 2 = -fijV l vi , w ■ v = WiV 1 , v ■ h ■ v = h i jV l v : > , etc. The 
4-vector component u° follows from applying the normalization condition 
= -1: 



u° 



a\J\ — v 2 



ip — w ■ v + 4>ir — v ■ h • v 
1 1~ ~ 2 



(4.21) 



In the absence of metric perturbations this looks like the standard result 
in special relativity aside from the factor a -1 that appears because we 
use comoving coordinates. With metric perturbations we can no longer 
interpret v exactly as the proper 3-velocity because adx 1 is not proper 
distance and adr is not proper time. However, the corrections are only 
first order in the metric perturbations. 

We will assume that the mean fluid velocity is nonrelativistic so that 
we can neglect all terms that are quadratic in v. (This does not exclude 
the radiation era, since we allow individual particles to be relativistic and 
require only the bulk velocity to be nonrelativistic.) We will also neglect 
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terms involving products of v and the metric perturbations. With these 
approximations, the 4-velocity components become 

u° = a _1 (l--0) , it* = a~V , uo = -a(l+V0 , u i = a(vi+Wi) .(4.22) 

The apparent lack of symmetry in the spatial components arises because 



U; 



gioU + gjjVp and gm — a Wi ^ in general. 



From eq. (4.22) we can see how Wi is interpreted as a frame-dragging 
effect. For w% ^ the worldline of a comoving observer (defined by the 
condition Vi — 0) is not normal to the hypersurfaces r = constant: — 
awi£, 1 ^ for a 3-vector £\ In a locally inertial frame, on the other hand, 
the worldline of a freely-falling observer obviously would be normal to the 
spatial directions. (This is true in special relativity and also in general 
relativity as a consequence of the equivalence principle.) By making a 
local Galilean transformation, dx l — > dx l + w l dr, we can remove Wi from 
the metric at a point. This transformation corresponds to choosing a locally 
inertial frame, called the normal frame, moving with 3- velocity —w relative 
to the comoving frame. In the normal frame the fluid 3-velocity is v + w. 

If Wi = Wi(r) is independent of x, one can remove Wi everywhere from 
the metric by a global Galilean transformation. (Try it and see!) However, 
we may be interested in situations where Wi = Wi(x,r) so that different 
transformations are required in different places. In this case there is no 
global inertial frame. Spatially varying iUj corresponds to shearing and/or 
rotation of the comoving frame relative to the normal frame. This is called 
the "dragging of inertial frames." Although we can choose coordinates in 
which Wi = everywhere, we shall see that there are advantages in not 
hiding the dragging of inertial frames. In general, the comoving frame 
is noninertial: an observer can remain at fixed x l only if accelerated by 
nongravitational forces. The synchronous gauge is an exception in that 
Wi = everywhere and the comoving frame is locally inertial. We shall see 
later that these features of synchronous gauge obscure rather than eliminate 
the physical dragging of inertial frames. 

Now that we have all the ingredients we can finally write the stress- 
energy tensor components in our perturbed comoving coordinate system in 
terms of physical quantities: 

T° = - P , t\ =-( p + p y , 

T° 4 = (p + P ) (vi + Wi ) , T- = P 6 i j + E*^ . (4.23) 

We use mixed components in order to avoid extraneous factors of a(l + ijj) 
and a(\ — 4>). N ote th at th e traceless shear stress £' • may be decomposed 



as in eqs. ( 4.13 ) and ( 4.14 ) into scalar, vector, and tensor parts. Similarly, 
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the energy flux density (p + p)v l may be decomposed into scalar and vector 
parts. (The pressure appears here, just as in special relativity, to account 
for the pdV work done in compressing a fluid. For a nonrelativistic fluid p <C 
p, but we shall not make this restriction.) We may already anticipate that 
these sources are responsible in the Einstein equations for scalar, vector, 
and tensor metric perturbations. 

In writing the components of the stress-energy tensor we have not as- 
sumed \5p\ -C p. The only approximations we make in the stress-energy 
tensor are to neglect (relative to unity) v 2 and all terms involving products 
of the metric perturbations with v and E^ . Of course, owing to the weak- 
field approximation, we are also neglecting any terms that are quadratic in 
the metric perturbations themselves. 

Before moving on to discuss the Einstein equations we should rewrite 
the conservation of energy-momentum, V^T^ = 0, in terms of our metric 
perturbation and fluid variables. (We use V M to denote the full spacetime 
covariant derivative relative to the 4- metric g^ u . It should not be confused 
with the spatial gradient V, defined relative to the 3-metric 7y.) Using 
the approximations mentioned in the preceding paragraph, one finds 

d T p + 3(?/ - 4>) (p + p) + V • [(p + p) v ] = (4.24) 

and 

9t [(p + p)(v + w)] +4ri(p + p)(v + w) 

+Vp + V -T + (p + p)VtP = . (4.25) 

(Deriving these gives useful practice in tensor algebra.) It is easy to in- 
terpret the various terms in these equations. The terms proportional to 
the expansion rate r\ arise because we are using comoving coordinates and 
conformal time and have not factored out a -3 from p or p. The pressure 
p is present with p because we let p be the energy density (not the rest- 
mass density), which is affected by the work pressure does in c omp ressing 



the fluid. Excluding these terms, the energy-conservation eq. ( 4.24 ) looks 
exactly like the Newtonian continuity equation aside from the change in 
the expansion rate from rj to r\ — (p. This modification is easily understood 
by noting from eq. (4.11) that the effective isotropic expansion factor 
is modified by spatial curvature perturbations to become a(l — cf>). The 
momentum-conservation eq. (4.25) similarly looks like the Newtonian ver- 
sion with a gravitational potential ?/>, aside from the special-relativistic 
effects of pressure and the addition of w to all the velocities to place them 
in the normal (inertial) frame. 
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4-4- Synchronous gauge 

Synchronous gauge, introduced by Lifshitz (1946) in his pioneering calcu- 
lations of cosmological perturbation theory, is defined by the conditions 
ip = Wi — 0, which eliminate two scalar fields (ip an d the longitudinal part 
of w) and one transverse vector field {w±). It is not difficult to show that 
synchronous coordinates can be found for any weakly-perturbed spacetime. 
However, the synchronous gauge conditions do not eliminate all gauge free- 
dom. This has in the past led to considerable confusion (for discussion see 
Press & Vishniac 1980 and Bardeen 1980). 

Synchronous gauge has the property that there exists a set of comoving 
observers who fall freely without changing their spatial coordinates. (This 
is nontrivial when one notes that in order to remain at a fixed terrestrial 
latitude, longitude, and altitude above the surface of the earth it is nec- 
essary to accelerate everywhere except in geosynchronous orbits.) These 
observers are called "fundamental" comoving observers. The existence of 
fundamental observers follows from the geodesic equation 

^+ r V«^0 (4-26) 

for the trajectory ^(A), where dX = ( — cLs 2 ) 1 / 2 for a timelike geodesic and 
= dx^/dX. With ip = Wi = 0, eq. ( 4.10 ) gives r l 00 = 0, implying that 
u l — is a geodesic. 

Each fundamental observer carries a clock reading conformal time r = 
J dt/a(t) and a fixed spatial coordinate label x % . The clocks and labels of 
the fundamental observers are taken to define the coordinate values at all 
spacetime points (assuming that these hypothetical observers densely fill 
space). The residual gauge freedom in synchronous gauge arises from the 
freedom to adjust the initial settings of the clocks and the initial coordinate 
labels of the fundamental observers. 

Because the spatial coordinates x % of each fundamental observer are held 
fixed with time, the x % in synchronous gauge are Lagrangian coordinates. 
This implies that the coordinate lines become highly deformed when the 
density perturbations become large. When the trajectories of two funda- 
mental observers intersect the coordinates become singular: two different 
sets of x^ label the same spacetime event. This flaw of synchronous gauge 
is not apparent if \8p/p\ <C 1 and the initial coordinate labels are nearly 
unperturbed, so this gauge may be used successfully (with some care re- 
quired to avoid contamination of physical variables by the residual gauge 
freedom) in linear perturbation theory. 

To be consistent with the conventional notation used for synchronous 
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gauge (Lifshitz 1946; Lifshitz & Khalatnikov 1963; Weinberg 1972; Peebles 
1993) , in this section only we shall absorb cj> into hij and double hij : 

ds 2 = a 2 (r) [-dr 2 + + h ij )dx i dx j ) , h=h\^0. (4.27) 

Using this line element and the definitions of the Ricci and Einstein tensors, 
it is straightforward (if rather tedious) to derive the components of the 
perturbed Einstein tensor: 

-a 2 G° = 3(t7 2 + K) + v h - ~ (V 2 + 2K) h + iv<Vj-/» y , (4.28) 
a 2 G\ = \ (Vth - V,/V) , G\ = -Y J G°j , (4.29) 



31 



-a 2 G l 3 = (2t? + V 2 + K) 5 l 3 + Q3 2 + v d T - iv 2 ^) (hS l 3 - h" 

+ ^ (V fe V ; /i M ) ^ . (4.30) 

One can easily verify that the unperturbed parts of the Einstein equations 
G° = 8nGT° = -8irGp and G%- = SttGT^ = SirGpS^ give the Fried- 
mann and energy-conservation equations for the background Robertson- 
Walker spacetime. 

Our next goal is to separate the perturbed Einstein equations into scalar, 
vector, and tensor parts. First we must decompose the metric perturbation 
field hij as in eqs. ( 4.13 )-( 4.15| ), with a term added (and the notation 
changed slightly) to account for the trace of hij : 

hj = J hrfa + Dij ( V~ 2 £) + + hij, T , (4.31) 



where was defined in eq. ( pJ3|) . We require Vj/i z = Vj/l 1 ,,- T = to 



3, 

ensure that the last two parts of hij are purely solenoidal (vector mode) 
and transverse-traceless (tensor mode) contributions. The scalar mode 
variables are h and V -2 £, whose Laplacian is £. We shall not worry about 
how to invert the Laplacian on a curved space but simply assume that it 
can be done if necessary. 

The perturbed Einstein equations now separate into 7 different parts 
according to the spatial symmetry: 
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G\- \(V 2 + -iK)(t-K)+T 1 h = %TTGa 2 ( P -p) , (4.32) 

G °i,\\ : l^ l (h-0-KV l {v- 2 {)^8TTGa 2 [(p + P )v l } ll ,(4.33) 

G° it± : -^(V 2 + 2K)h l = 8TrGa 2 [{p + p)v l } ± , (4.34) 

G\: -(d 2 T + 2rjd T )h+^(V 2 + 3K)(h-0 

= 24irGa 2 {p-p) , (4.35) 

G l ^ || : (\t + r?3r) A, (V" 2 £) + - h) 



= SkGo 2 ^. I, , (4.36) 



& j>x : {^-dl + vdrj V {l h 3) = 8vrGa 2 E y , ± , (4.37) 

G l jt T : Qd* + t ? 3 t - i V 2 + ^ fty, T = 87rGo 2 E«, T . (4.38) 

The derivation of these equations is straightforward but tedious. They have 
decomposed naturally into separate equations for the scalar, vector, and 
tensor parts of the metric perturbation, with the sources for each given by 
the appropriate part of the energy- momentum tensor. However, there are 
more equations than unknowns! There are four scalar equations for £ and 
h, two vector equations for hi, and one tensor equation for Ziy, t- How can 
this be? 

Before answering this question, let us note another interesting feature 
of the equations above, which will provide a clue. The equations arising 
from G° „ involve only a single time derivative of the scalar and vector 
mode variables, while those arising from G l „ have two time derivatives, 
as we might have expected for equations of motion for the gravitational 
fields. This means that we could discard eqs. (4.32)-(4.34) and be left 
with exactly as many second-order in time equations as unknown fields. 
Alternatively, we could discard eqs. (4.35)— (4.37) and be left with exactly 
enough first-order in time equations for the scalar and vector modes. Only 
the tensor mode evolution is uniquely specified by a second-order wave 
equation. 

The reason for this redundancy is that the twice-contracted Bianchi iden- 



tities of differential geometry, V^G^^ = 0, force the Einstein eqs. (4.7) 
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to imply V^T^j, = 0. The Einstein equations themselves contain redun- 
dancy, as we can check explicitly here. By combining the time derivative 
of eq. (4.32) and the divergence of eqs. (4.33) and (4.34) one obtains 
the perturbed part of eq. (4.24) (note, however, that <f> — > —h/6). Simi- 
larly, eq. (4.25) follows from the time derivative of eqs. (4.33) and (4.34) 
combined with the gradient of eqs. (4.35)-(4.37). Because we require the 
equations of motion for the matter and radiation to locally conserve the net 
energy-momentum, three of the perturbed Einstein eqs. (4.32)-(4.38) are 
redundant. 

In the literature, G° = 8irGT° Q is often called the "ADM energy con- 
straint" and G° i = SttGT^ is called the "ADM momentum constraint" 
equation. The 3+1 space-time decomposition of the Einstein equations into 
constraint and evolution equations was developed in detail by Arnowitt, 
Deser & Misner (1962, ADM) and applied to cosmology by Durrer & 
Straumann (1988) and Bardeen (1989). The ADM constraint equations 
may be regarded as providing initial-value constraints on (h, £, h, £, hi) and 
the matter variables. If these constraints are satisfied initially (this is re- 
quired for a consistent metric), and if eqs. (4.35)-(4.37) are used to evolve 
(h,£,h,£,hi) while the matter variables are evolved so as to locally con- 
serve the net energy-momentum, then the ADM constraints will be fulfilled 
at all later times. (This follows from the results stated in the preceding 
paragraph.) In effect, the Einstein equations have built into themselves the 
requirement of energy-momentum conservation for the matter. If one were 
to integrate eqs. (4.35)-(4.37) correctly but to violate energy-momentum 
conservation, then eqs. (4.32)-(4.34) would be violated. 

In practice, we may find it preferable to regard the ADM constraints 
alone — and not eqs. (4.35)-(4.37) — as giving the actual field equa- 
tions for the scalar and vector metric perturbations. They have fewer time 
derivatives and hence are easier to integrate. Equations (4.35)-(4.37) are 
not necessary at all (although they may be useful for numerical checks) 
because they can always be obtained by differentiating eqs. (4.32)-(4.34) 
and using energy-momentum conservation. 

This situation becomes clearer if we compare it with Newtonian gravity. 
The field equation V 2 = AnGa 2 Sp is analogous to eq. (4.32). (We shall 
see this equivalence much more clearly in the Poisson gauge below.) Let us 
take the time derivative: V 2 = Ai:Gd T (a 2 Sp). If we now replace d T (6p) 
using the continuity equation, we obtain a time evolution equation for 
V 2 analogous to the divergence of eq. (4.33). The solutions to this 
evolution equation obey the Poisson equation if and only if the initial (f> 
obeys the Poisson equation. Why should one bother to integrate V 2 </> in 
time when the solution can always be obtained instantaneously from the 
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Poisson equation? Viewed in this way, we would say that the extra time 
derivatives in the G l „ equations have nothing to do with gravity per se. 
The real field equations for the scalar and vector modes come from the 
ADM constraint equations. 

If the scalar and vector metric perturbations evolve according to first- 
order in time equations, their solutions are not manifestly causal (e.g., 
retarded solutions of the wave equation). We shall discuss this point in 
detail in section 4.7. However, for now we may note that the tensor mode 
obeys the wave eq. (4.38). The solutions are the well-known gravity waves 
which, as we shall see, play a key role in enforcing causality. The source 
for these waves is given by the transverse-traceless stress (generated, for 
example, by two masses orbiting around each other). The rjd T term arises 
because we use comoving coordinates and the K term arises as a correction 
to the Laplacian in a curved space; otherwise the vacuum solutions are 
clearly waves propagating at the speed of light. Abbott & Harari (1986) 
show that eq. (4.38) is the Klein-Gordon equation for a massless spin-two 
particle. 



4-5. Gauge modes 



As we noted above, the synchronous gauge conditions do not completely 
fix the spacetime coordinates because of the freedom to redefine the per- 
turbed constant-time hypersurfaces and to reassign the spatial coordinates 
within these hypersurfaces. This freedom is not obvious in the linearized 
Einstein equations for the scalar and vector modes, but it is present in the 
form of additional solutions that must be fixed by appropriate choice of 
initial conditions and that represent nothing more than relabeling of the 
coordinates in an unperturbed Robertson- Walker spacetime. 

To see this effect more clearly, we consider a general infinitesimal coor- 
dinate transformation from (t, x l ) to (t,x 1 ), known as a gauge transfor- 
mation: 

f = t + a(x, t) , x % — x l + 7 y 'Vjj9(a;,T) + e l (x, t) , 

with V ■ e = . (4.39) 

For convenience we have split the spatial transformation into longitudinal 
and transverse parts. Note that the transformed time and space coordinates 
depend in general on all four of the old coordinates. 

Coordinate freedom leads to ambiguity in the meaning of density per- 
turbations. Consider, for example, the simple case of an unperturbed 
Robertson- Walker universe in which the density depends only on r (if one 
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uses the "correct" r coordinate) . In the transformed system it depends also 
on x l : p(f) = p(r) + (d T p)a(x, r). In other words, even in an unperturbed 
universe we can be fooled into thinking there are spatially-varying density 
perturbations. 

This example may seem contrived, but the ambiguity is not trivial to 
avoid: When spacetime itself is perturbed, and time is not absolute, what 
is the best choice of time? The same question arises for the spatial coordi- 
nates. 

To clarify this situation we must examine gauge transformations further. 
First note that when we transform the coordinates we must also transform 
the metric perturbation variables so that the line element ds 2 (a spacetime 
scalar) is invariant. It is straightforward to do this using eqs. (4.11) and 
(4.39). The result is 

1 

ip = ip — a — r/a , = 0+ - V p + r\a , 

Wi = vii + V 'i{a - 0) - e< , hij = h i:j - Dij/3 - V (i ej) , (4.40) 



where £>y is the traceless double gradient operator defined in eq. ( 4.15Q 



The transformed fields (with carets) are to be evaluated at the same coor- 
dinate values (r, x l ) as the original fields. 

Suppose now that our original coordinates satisfy the synchronous gauge 



conditions ip = Wi = 0. [To recover the notation of eq. ( 4.27 ) used specially 



for synchronous gauge we now double hij and put the trace of hij into 



h = —60.] From eqs. (4.40) and ( 4.27 ) it follows that there is a whole 



family of synchronous gauges with metric variables related to the original 
ones by 

h = h- 2V 2 /3 - 6r)P , | = £ - 2V 2 /3 , h z = h t - 2e, , (4.41) 
where 

P = fa(x) / -jt , ei = ei(x) . (4.42) 

Thus, the synchronous gauge has residual freedom in the form of one scalar 
and one transverse vector (ti) function of the spatial coordinates. 
The presence of these extraneous solutions (called gauge modes) has 
created a great deal of confusion in the past, which might have been avoided 
had more cosmologists read the paper of Lifshitz (1946). In 1980, Bardeen 
wrote an influential paper showing how one may take linear combinations of 
the metric and matter perturbation variables that are free of gauge modes. 
For example, Bardeen defined two scalar perturbations <$>a an d <&h related 
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to our synchronous gauge variables h and £ (Bardeen actually used the 
variables Hl = h/6 and Ht = — £/2) as follows: 

$ A = -iv- 2 (e + ? ?0 , ^ = ^(^-0-^V- 2 e. (4.43) 
It is easy to check that these variables are invariant under the synchronous 



gauge transformation given by eqs. ( 4.41 )-( 4.42) 



Bardeen's work led to a flurry of papers concerning gauge-invariant vari- 
ables in cosmology. A standard reference is the classic paper by Kodama 
& Sasaki (1984). Elegant treatments based on general 3+1 splitting of 
spacetime were given later by Durrer & Straumann (1988) and Bardeen 
(1989). The simpler form of the gauge-invariant variables often makes it 
easier to find analytical solutions (e.g., Rebhan 1992). However, it is not 
necessary to use gauge-invariant variables during a calculation, and many 
cosmologists have continued successfully to use synchronous gauge. In the 
end, when the results are converted to measurable quantities — spacetime 
scalars — the gauge modes automatically get canceled. In a numerical solu- 
tion, however, one must be careful that the gauge modes do not swamp the 
physical ones, otherwise roundoff can produce significant numerical errors. 

Gauge invariant variables actually appear somewhat strange if we con- 
sider the analogous situation in electromagnetism. The electric and mag- 
netic fields in flat spacetime may be obtained from potentials <j) and A 
(note we are implicitly using a 3+1 split of spacetime), 

E = - V0 - d T A , B = V x A . (4.44) 

With this choice, the source-free Maxwell equations are automatically sat- 
isfied; the other two (the Coulomb and Ampere laws) become 

V 2 + 3 r (V • A) = -4irp , (9 2 — V 2 ) A + = 4irJ , (4.45) 

where p is the charge density and J is the current density. These equations 
are invariant under the gauge transformation <f> = <f>— d T a, Ai = Ai + VjQ;. 

If we didn't know about electric and magnetic fields, but were alarmed 
by the gauge-dependence of the potentials, we could try to find linear com- 
binations of cp and A that are gauge-invariant. However, there are two 
well-known and more direct ways to eliminate gauge modes. The first is 
"gauge fixing" — i.e., placing constraints on the potentials so as to elim- 
inate gauge degrees of freedom. One popular choice, for example, is the 
Coulomb gauge V-A — 0, so that A = A± is transverse. The transversality 
condition means that the gauge transformation variable a cannot depend 
on position (though it can depend on time); thus, most of the gauge free- 
dom is eliminated. The second possibility is to work with the physical fields 
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themselves instead of the potentials: E and B are automatically gauge- 
invariant. This procedure requires that we analyze the equation of motion 
for charges to determine which combinations of <j) and A are physically 
most significant. 

In the next section we shall adopt the first procedure (gauge-fixing) using 
the gravitational analogue of the Coulomb gauge. Later we shall introduce 
Ellis' covariant approach based on gravitational fields themselves. 

4-6. Poisson gauge 

Recall that our general perturbed Robertson- Walker metric (4.11) contains 
four extraneous degrees of freedom associated with coordinate invariance. 
In the synchronous gauge these degrees of freedom are eliminated from g 0Q 
(one scalar) and goi (one scalar and one transverse vector) by requiring 
ip = Wi = 0. There are other ways to eliminate the same number of fields. 
As we shall see, a good choice is to constrain goi (eliminating one scalar) 
and gij (eliminating one scalar and one transverse vector) by imposing the 
following gauge conditions on eq. (4.11): 

V • w = , V • h = . (4.46) 

I call this choice the Poisson gauge by analogy with the Coulomb gauge 
of electromagnetism (V • A = 0) .111 More conditions are required here than 
in electromagnetism because gravity is a tensor rather than a vector gauge 
theory. Note that in the Poisson gauge there are two scalar potentials {ijj 
and 0), one transverse vector potential (w), and one transverse-traceless 
tensor potential h. 

A restricted version of the Poisson gauge, with iu, = hij = 0, is 
known in the literature as the longitudinal or conformal Newtonian gauge 
(Mukhanov, Feldman & Brandenberger 1992). These conditions can be 
applied only if the stress-energy tensor contains no vector or tensor parts 
and there are no free gravitational waves, so that only the scalar metric 
perturbations are present. While this condition may apply, in principle, 
in the linear regime Q8p/p\ <C 1), nonlinear density fluctuations generally 
induce vector and tensor modes even if none were present initially. Setting 
w = h = is analogous to zeroing the electromagnetic vector potential, 

t The same gauge has been proposed recently by Bombelli, Couch & Torrence (1994), 
who call it "cosmological gauge." However, I prefer the name Poisson gauge because 
cosmology — i.e., nonzero a — is irrelevant for the definition and physical interpreta- 
tion of this gauge. Although I have seen no earlier discussion of Poisson gauge in the 
literature, its time slicing corresponds with the minimal shear hypersurface condition of 
Bardeen (1980). 
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implying B — 0. In general, this is not a valid gauge condition — it is 
rather the elimination of physical phenomena. The longitudinal/conformal 
Newtonian gauge really should be called a "restricted gauge." The Poisson 
gauge, by contrast, allows all physical degrees of freedom present in the 
metric. 

To prove the last statement, and to find out how much residual gauge 
freedom is allowed, we must find a coordinate transformation from an ar- 
bitrary gauge to the Poisson gauge. Using eq. (4.40) with hats indicating 
Poisson gauge variables, we see that a suitable transformation exists with 

a = w + h , [3 = h , £j = hi , (4.47) 

where w comes from the longitudinal part of w (wu = -Vw). while h 
and hi come from the longitudinal and solenoidal parts of h in eq. (4.14). 
Because these conditions are algebraic in a, (3, and e (they are not differ- 
entiated, in contrast with the transformation to synchronous gauge of eq. 
4.41 ), we have found an almost unique transformation from an arbitrary 
gauge to the Poisson gauge. One can still add arbitrary functions of time 
alone (with no dependence on x l ) to a and tj. (Adding a function of time 
alone to (3 has no effect at all because the transformation, eq. 4.39, involves 
only the gradient of /?.) 

Spatially homogeneous changes in a represent changes in the units of 
time and length, while spatially homogeneous changes in e represent shifts 
in the origin of the spatial coordinate system. These trivial residual gauge 
freedoms — akin to electromagnetic gauge transformations generated by 
a function of time, the only gauge freedom remaining in Coulomb gauge 
— are physically transparent and should cause no conceptual or practical 
difficulty. 

It is interesting to see the coordinate transformation from a synchronous 
gauge to the Poisson gauge. As an exercise the reader can show that this 
is given by 

^ = -~V- 2 (£f»tf), 4=Uz-h) + ±-nV- 2 £, Wi = -h T hi.(4A8) 
2 b I I 

Comparing with eq. ( 4.43 ), we see that the two Poisson-gauge scalar po- 
tentials are ip = $a and <f> = — (Kodama & Sasaki 1984 call these 
variables \& = ip and $ = —<j)-) The vector potential Wi in Poisson gauge is 
relat ed simply to the solenoidal potential hi of the synchronous gauge (eq. 
Ol|) . 

Thus, the metric perturbations in the Poisson gauge correspond exactly 
with several of the gauge-invariant variables introduced by Bardecn. By 
imposing the explicit gauge conditions (4.46), we have simplified the math- 
ematical analysis of these variables. 
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Now that we have seen that the Poisson gauge solves the gauge-fixing 
problem, let us give the components of the perturbed Einstein equations. 
They are no more complicated than those of the synchronous gauge: 

(V 2 + 3iT) - 377 (4, + rjip] = ATrGa 2 (p - p) , (4.49) 

-Vi(0 + T#) = 4nGa 2 [(p + p){vi+w i )} ll , (4.50) 
(V 2 + 2K)w l = 16nGa 2 [(p + p)(v t +w i )} ± , (4.51) 

<j> - K<t> + T)(if> + 24>) + (2r? + f] 2 )t/j - - V 2 (4> - VO 

= 4nGa 2 {p-p) , (4.52) 

D i:j {(j)-^) = 87rGa 2 S y - „ , (4.53) 

- (9 r + 2rf) V(iWj) = 87rGa 2 S„- ± , (4.54) 

(3 2 + 2?7d T - V 2 + 2K) h i:j = 8vrGa 2 S li: T . (4.55) 

As in the synchronous gauge, the scalar and vector modes satisfy initial- 
value (ADM) constraints (eqs. 4.49-4.51) in addition to evolution equa- 
tions. However, it is remarkable that in the Poisson gauge we can obtain the 
scalar and vector potentials directly from the instantaneous stress-energy 
distribution with no time integration required. This is clear for </> — ?/' an d 
w, both of which obey elliptic equations with no time derivatives (eqs. 4.53 
and 4.51, respectively). By combining the ADM energy and longitudinal 
momentum constraint equations we can also get an instantaneous equation 
for </>: 

(V 2 + 3K) 4 ^ AttGo 2 [6p + 3r]^ f ] , -V$/ = [{p + p)(v + w)] l{ . 

(4.56) 

Bardeen (1980) defined the matter perturbation variable e m = (5p + 
377$/ )/p and noted that it is the natural measure of the energy density 
fluctuation in the normal (inertial) frame a t res t with the matter such that 
v + w = (recall the discussion in section |4~3| ) . However, for our analysis 
we will remain in the comoving frame of the Poisson gauge, in which case 
5p/p and not e m is the density fluctuation. 

We can show that for nonrelativistic matter the field equations we have 
obtained reduce to the Newtonian forms. First, it is clear that in the non- 
cosmological limit (77 = K = 0), eq. (4.56) reduces to the Poisson equation. 
For 77 ^ the longitudinal momentum density $/ is also a source for <f>, but 
it is unimportant for perturbations with \8p/p\ 3> vhv/c 2 where vh is the 
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Hubble velocity across the perturbation. Next, consider the implications 
of the fact that the shear stress for any physical system is at most 0(pc 2 ) 
where c s is the characteristic thermal speed of the gas particles. (For a 
collisional gas the shear stress is much less than this.) Equation (4.53) 
then implies that the relative difference between tp and <f> is no more than 
0(c s /c) 2 . Third, eq. (4.51) implies that the vector potential w <~ (vh/c) 2 v. 
Thus, the deviations from the Newtonian results are all 0(v/c) 2 . Poisson 
gauge gives the relativistic cosmological generalization of Newtonian gravity. 

There are still more remarkable features of the Poisson gauge. First, 
the Poisson gauge metric perturbation variables are almost always small 
in the nonrelativistic limit (|<^| <C c 2 , v 2 <§; c 2 ), in contrast with the syn- 
chronous gauge variables hij, which become large when \Sp/p\ > 1. (How- 
ever, Bardeen 1980 shows that the relative numerical merits of these two 
gauges can reverse for isocurvature perturbations of size larger than the 
Hubble distance.) Second, if (t(),<j),w,h) are very small, they — but not 
necessarily their derivatives! — may be neglected to a good approximation, 
in which case the Poisson gauge coordinates reduce precisely to the Eule- 
rian coordinates used in Newtonian cosmology. Finally, it is amazing that 
the scalar and vector potentials depend solely on the instantaneous distri- 
bution of stress-energy — in fact, only the energy and momentum densities 
and the shear stress are required. Only the tensor mode — gravitational ra- 
diation — follows unambiguously from a time evolution equation. In fact, 
it obeys precisely the same equation as in the synchronous gauge (with 
a factor of 2 difference owing to our different definitions) because tensor 
perturbations are gauge-invariant — coordinate transformations involving 
3-scalars and a 3- vector cannot change a 3-tensor (leaving aside the special 
case of eq. 4.17 for a closed space). 



4-7. Physical content of the Einstein equations 

In the last section we showed that the Poisson gauge variables (ijj, <f>,w) 
are given by the instantaneous distributions of energy density, momentum 
density, and shear stress (longitudinal momentum flux density). Is this 
action at a distance i n gen eral relativity? 



We showed in eq. ( 4.47 ) that the Poisson gauge can be transformed to 
any other gauge. In the cosmological Lorentz gauge (see Misner et al. 1973 
for the noncosmological version) all metric perturbation components obey 
wave equations. Therefore, the solutions in Poisson gauge must be causal 
despite appearances to the contrary. 

There is a precedent for this type of behavior: the Coulomb gauge of 



Cosmological Dynamics 



59 



electromagnetism. With V • A — 0, eqs. ( 4.45| ) become 



V 2 <j) = -4np , = 4ttJ|| , (S 2 . — V 2 ) A = 4ttJ ± . (4.57) 

We have separated the current density into longitudinal and transverse 
parts. The similarity of the first two (scalar) equations to eqs. (4.49) 
and (4.50) is striking. The similarity would be even more striking if we 
were to use comoving coordinates rather than treating x and r here as flat 
spacetime coordinates. As an exercise one can show that with comoving 
coordinates, p and J will be multiplied by a 2 and that <fi becomes (f> + r/cj). 
The last step follows when one distinguishes time derivatives at fixed x 
from those at fixed ax. 

Are we to conclude that electromagnetism also violates causality, because 
the electric potential <f> depends only on the instantaneous distribution of 
charge? No! To understand this let us examine the Coulomb and Ampere 
laws in flat spacetime for the fields rather than the potentials: 

V E = V-E\\ = 4np , -d T E\\ = 4ttJ|| , VxB-d T E ± = 4irJ ± .(4.58) 

The Ampere law has been split into longitudinal and transverse parts. We 
see that the longitudinal electric field indeed is given instantaneously by 
the charge density. Because the photon is a massless vector particle, only 
the transverse part of the electric and magnetic fields is radiative, and its 
source is given by the transverse current density: 

(3 2 — V 2 ) B = 4ttV x J ± , (3 2 - V 2 ) E ± = -4ird T J ± . (4.59) 

But how does this restore causality? To see how, let us consider the 
following example. Suppose that there is only one electric charge in the 
universe and initially it is at rest in the lab frame. If the charge moves — 
even much more slowly than the speed of light — En — the solution to the 
Coulomb equation — is changed everywhere instantaneously. It must be 
therefore that E± also changes instantaneously in such a way as to exactly 
cancel the acausal behavior of _E7|| . 

This indeed happens, as follows. First, note that the motion of the charge 
generates a current density J = J» + «7j_. The longitudinal and transverse 
parts separately extend over all space (and are in this sense acausal) while 
their sum vanishes away from the charge (as do V • Jji a nd V x J±). The 
magnetic and transverse electric fields obey eqs. ( 4.59| ). Because J± is 



distributed over all space but V x Ji is not, retarded- wave solutions for 
B are localized and causal while those for E± are not. However, when 
E» is added to E±, one finds that the net electric field is causal (Brill & 
Goodman 1967). It is a useful exercise to show this in detail. 
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Now that we understand how causality is maintained, what is the use 
of the longitudinal part of the Ampere law, — d T E\\ = AttJ\\1 The answer 
is, to ensure charge conservation, which is implied by combining the time 
derivative of the Coulomb law with the divergence of the Ampere law: 

3 T p+V- J = d T p + V- Jy =0 . (4.60) 

Charge conservation is built into the Coulomb and Ampere laws. This 
remarkable behavior occurs because electromagnetism is a gauge theory. 
Gauge invariance effectively provides a redundant scalar field equation 
whose physical role is to enforce charge conservation. From Noether's the- 
orem (e.g., Goldstein 1980), a continuous symmetry (in this case, electro- 
magnetic gauge invariance) leads to a conserved current. 

General relativity is also a gauge theory. Coordinate invariance — a 
continuous symmetry — leads to conservation of energy and momentum. 
As a result there are redundant scalar and vector equations [eqs. ( 4.50) , 



(4.52), and (4.54)] whose role is to enforce the conservation laws [eqs. (4.24) 
and (4.25)]. We are free to use the action-at-a-distance field equations for 
the scalar and vector potentials in Poisson gauge because, when they are 
converted to fields and combined with the gravitational radiation field, the 
resulting behavior is entirely causal. 

The analogy with electromagnetism becomes clearer if we replace the 
gravitational potentials by fields. We define the "gravitoelectric" and 
"gravitomagnetic" fields (Thorne, Price & Macdonald 1986; Jantzen, 
Carini & Bini 1992) 

g = -Vip - d T w , H = V xw , (4.61) 



using the Poisson gauge variables ip and w. In section 4.8 we shall see 
how these fields lead to "forces" on particles similar to the Lorentz forces 
of electromagnetism. For now, however, we are interested in the fields 
themselves. 

Note that g and H arc invariant under the transformation ip — > ip — d, 
w — > w + Va. In the noncosmological limit (n = 0) this is a gauge 
transformation corresponding to transformation of the time coordinate (cf. 
eqs. 4.39 and 4.40). However, gauge transformations in general relativity 
are complicated by the fact that they change the coordinates and fields 
as well as the potentials. For example, the na terms in eq. (4.40) arise 
because the transformed metric is evaluated at the old coordinates. Thus, g 
should acquire a term rj Va under a true gauge (coordinate) transformation, 



which is incompatible with eq. (4.61). The actual transformation (ip 



ip — d, w — ► w + Va) is not a coordinate transformation. General relativity 
differs from electromagnetism in that gauge transformations change not 
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just the potentials but also the coordinates used to evaluate the potentials; 
remember that the potentials define the perturbed coordinates! Only in 
a simple coordinate system, such as Poisson gauge — the gravitational 
analogue of Coulomb gauge — is it possible to see a simple relation between 
fields and potentials similar to that of electromagnetism. 

In the limit of comoving distance scales small compared with the cur- 
vature distance |i4f| -1 / 2 and the Hubble distance r]~ 1 , and nonrelativistic 
shear stresses, the gravitoelectric and gravitomagnetic fields obey a gravi- 
tational analogue of the Maxwell equations: 

V • g = -4nGa 2 5p , V xg + d T H = , 

V • H = , V x H = -167rGa 2 /_i , (4.62) 

where / = (p+p)(v + w) is the momentum density in the normal (inertial) 
frame. (You may derive these equations using eqs. 4.49, 4.50, 4.53, and 



4.61.) These equations differ from their electromagnetic counterparts in 
three essential ways: (1) the sources have opposite sign (gravity is attrac- 
tive), (2) the transverse momentum density has a coefficient 4 times larger 
than the transverse electric current (gravity is a tensor and not a vector 
theory), and (3) there is no "displacement current" — d T g in the trans- 
verse Ampere law for V x H . Recalling that Maxwell added the electric 
displacement current precisely to conserve charge and thereby obtained 
radiative (electromagnetic wave) solutions, we understand the difference 
here: the vector component of gravity is nonradiative. Unlike the photon, 
the graviton is a spin-2 particle (or would be if we could quantize general 
relativity!), so radiative solutions appear only for the (transverse-traceless) 
tensor potential hij. In fact, the vector potential is nonradiative precisely 
because it is needed to ensure momentum conservation; mass conservation 
is already taken care of by the scalar potential. Recall the role of the ADM 



constraint equations discussed in section 4.4. Gravity has more conserva- 
tion laws to maintain than electromagnetism and consequently needs more 
fields to constrain. 

Obtaining this physical insight into general relativity is much easier in 
the Poisson gauge than in the synchronous gauge. This fact alone is a 
good reason for preferring the former. When combined with the other 
advantages (simpler equations, no time evolution required for the scalar 
and vector potentials, reduction to the Newtonian limit, no nontrivial gauge 
modes, and lack of unphysical coordinate singularities), the superiority of 
the Poisson gauge should be clear. 

Although the physical picture we have developed for gravity in anal- 
ogy with electromagnetism is beautiful, it is inexact. Not only have we 
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linearized the metric, we have also neglected cosmological effects in eqs. 



(4.62). We shall see in section 4.9 how to obtain exact nonlinear equations 



for (the gradients of) the gravitational fields. 
4-8. Hamiltonian dynamics of particles 

In this section we extend to general relativity the Hamiltonian formula- 
tion of particle dynamics that is familiar in Newtonian mechanics. In the 
process we shall obtain further insight into the physical meaning of the 
gravitational fields discussed in the previous section. A preliminary ver- 
sion of this material appears in (Bertschinger 1993). A related presentation 
in the context of gravitational fields near black holes is given by Thorne et 
al. (1986). 

As in the nonrelativistic case, we choose a Hamiltonian that is related 
to the energy of a particle. Consequently, our approach is not manifestly 
covariant; the energy depends on how spacetime is sliced into hypersurfaccs 
of constant conformal time r because the energy is the time component of 
a 4-vector. Nevertheless, our approach is fully compatible with general 
relativity; we must only select a specific gauge. For simplicity we sha ll 
adopt the Poisson gauge, eq. (4.11) with gauge conditions eq. ( 4.46p . 



We assume that the metric perturbations are given by a solution of the 
field eqs. (4.49)-(4.55). Our Hamiltonian will include only the degrees of 
freedom associated with one particle; one can generalize this to include 
many particles (even treated as a continuum) and the metric variables 
(Arnowitt et al. 1962; Misner et al. 1973; Salopek & Stewart 1992) but 
this involves more machinery than necessary for our purposes. 

The goal of the Hamiltonian approach is to obtain equations of motion 
for trajectories in the single-particle phase space consisting of the spatial 
coordinates x % and their conjugate momenta. The first question is, what 
are the appropriate conjugate momenta? This question practically answers 
itself when we express the action scalar in terms of our coordinates: 

S = j P^dx" = J (p t + Pi^jp) dr = J (-H + PiX 1 ) dr . (4.63) 

Note that we have automatically expressed the action in terms of the co- 
variant (lower-index) components of the 4-momentum (also known as the 
components of the momentum one-form). We can read off the Hamiltonian 
and conjugate momenta using the fact that S = J Ldr where L(x % , x J , t) 
is the Lagrangian, which is related to the Hamiltonian H(x l , Pj, r) by the 
Legendre transformation L = P^x % — H. The Hamiltonian therefore is 
H = —P T — despite appearances, we shall see that this is not in general 
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the proper energy — and the conjugate momenta equal the covariant spa- 
tial components of the 4-momentum. Indeed, we may simply define the 
conjugate momenta and Hamiltonian in this way. (Care should be taken 
not to confuse the Hamiltonian H with the Hubble parameter H and the 
gravitomagnetic field H\) 

With these definitions, H and Pi correspond to the usual quantities en- 
countered in elementary nonrelativistic mechanics, but we need not rely on 
this fact. For any choice of spacetime geometry and coordinates we may de- 
termine the corresponding Hamiltonian and conjugate momenta from the 
4-momentum components: for a particle of mass to, H = — mgo^dx 11 / dX, 
Pi = mgi^dx^ 1 1 'd\ where dX measures proper time along the particle 
trajectory. As an exercise, one may show that with cylindrical coordi- 
nates (r, 9, z) for a nonrelativistic particle of mass to in Minkowski space- 
time, P r = mr is the radial momentum, Pg = mr 2 6 is the angular 
momentum about e 2 , P z = mz is the linear momentum along e z , and 
H = E pa to + (P 2 + Pg/r 2 + P 2 )/2m is the proper energy (including the 
rest mass energy). We shall determine the functional form H(x l , Pj,r) for 
our perturbed Robertson- Walker spacetime below. 

First, however, let us show that our approach leads to the usual canonical 
Hamilton's equations of motion, rigorously justifying our choices H = —P T 
and Pi being the momentum conjugate to x 1 . To do this we simply vary 
the phase space trajectory {x 1 (t), Pj(r)} to {x l + Sx l , Pj + SPj}, treating 
5x l (r) and 8Pj{r) as independent variations and computing the variation 



of the action of eq. (4.63): 



, , dH ■ dH dx\^ d . , \ , 

dS = / ~-n Sx - ^w sp i + -i~ 5p i + p i~rte dr 

ox 1 oPi dr dr J 

dx* dH\ (dP % dH\ 



dr , (4.64) 



where we have assumed Pi8x l = at the endpoints of integration. Requir- 
ing the action to be stationary under all variations, SS — 0, we obtain the 
standard form of Hamilton's equations: 

dr dP, ' dr dx* ' [ ' 

Thus, Hamilton's equations give phase space trajectories in general rela- 
tivity just as they do in nonrelativistic mechanics. 

Our next step is to determine the Hamiltonian for the problem at hand. 
We shall assume that the particle falls freely in the perturbed Robertson- 
Walker spacetime described in the Poisson gauge. For comparison with the 
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nonrelativistic results, it is useful to relate the 4-momentum components 
to the proper energy and 3-momentum measured by a comoving observer 
(i.e., one at fixed x l ), E and pf. 



Pr 



i(l + ^j)E , Pi = a [(1 - <j>)(pi + E Wi ) + h l0 p>] 



(4.66) 



The first equation follows from E = — u M P M where u M is the 4-velocity of a 



comoving observer from eq. (1-21) with v = 0, while the second equation 
follows from projecting P^ into the hypersurface normal to u M and normal- 
izing to give the proper 3-momentum. The weak-field approximation has 
been made (i.e., terms quadratic in the metric perturbations are neglected), 
but the particle motion is allowed to be relativistic. The factors a(l + ip) 
and a(l — ip) are obviously needed from eq. (4.11) to convert proper quan- 
tities into coordinate momenta, the Ewi term arises because our space and 
time coordinates are not orthogonal if there is a vector mode, and the hijp 3 
term arises because our spatial coordinates are not orthogonal if there is 
a tensor mode. The reader may verify that the 4-momentum satisfies the 
normalization condition g^P^Py = —E 2 + p 2 = —to 2 , and that this con- 
dition would be violated in general without the vector and tensor terms in 
Pi- 

Using these results it is easy to show that, to first order in the metric 
perturbations, the Hamiltonian is 

1/2 



H{x\P h r) 



(1 



)P - ew - h 



PI 2 + a 2 m 2 



where 



(P 2 + a 2 m 2 



iV2 



+ eV, (4-67) 



(4.68) 



and the squares and dot products of 3- vectors such as Pi, Pi, and h^Pj are 
com puted using the 3 -metric, e.g., P 2 — 7 y PiPj. Using the Hamiltonian of 
eq. ( P~67l ), eqs. ( ^65|) may be shown to be fully equivalent to the geodesic 
equations for a freely falling particle moving in the metric of eq. (4.11), 
and they could also be obtained starting from a Lagrangian approach. 
The advantage of the Hamiltonian approach is that it treats positions and 
conjugate momenta equally as is needed for a phase space description. 

Equation (4.67) appears strange at first glance. To understand it better, 
let us recall the standard form for the Hamiltonian of a particle with charge 
e in electromagnetic fields (with <j) being the electrostatic potential): 

1 1/2 

n 1 



H e (x%Pj,t) 



+ eq 



(4.69) 



Note that the proper moment um is p = P — e A where P is the conjugate 
momentum. Comparing eqs. (4.67) and ( 4.69| ), we see that they are very 
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similar aside from the tensor term h • P present in the gravitational case. 
The few remaining differences are easily understood. To compensate for 
spatial curvature — effectively a local change of the units of length — in 
the gravitational case P is multiplied by (1 + 0). The electric charge e 
is replaced by the gravitational charge e (energy!); to zeroth order in the 
perturbations e = H = aE. The use of comoving coordinates is respon- 
sible for the factors of a(r). The gravitational (gravitomagnetic) vector 
potential is w — as we anticipated in eq. (4.61). Finally, the electrostatic 
potential energy e<p is replaced in the gravitational case by eip. The strong 
analogy between the vector mode and magnetism accounts for the adjective 
"gravitomagnetic . " 

A different interpretation of the gravitomagnetic contribution to the 
Hamiltonian will clarify the relati on o f gravitomagnetism and the drag- 
ging of inertial frames. In section 4.3 we noted that w is the velocity of 
the comoving frame relative to a locally inertial frame (the normal frame). 
For w 2 -C I, p' = p + Ew is therefore the proper momentum in the nor- 
mal frame. According to eq. ( 4.66 ), then, neglecting the scalar and tensor 
modes, P is the comoving momentum (i.e., multiplied by a) in the normal 
frame, P = ap' , while P ew (the combination present in the Hamilto- 
nian) is the comoving momentum in the comoving frame. It is logical that 
the Hamiltonian should depend on the latter quantity; after all, we are us- 
ing non-orthogonal comoving spacetime coordinates. However, it is equally 
reasonable that the conjugate momentum should be measured in the frame 
normal to the hypersurface r = constant. Thus, it is simply the offset 
between these two frames — if one likes, the dragging of inertial frames — 
that is responsible for the — ew term in eq. ( 4.67| ). Gravitomagnetism — 
and similarly magnetism, if one interprets (e/m)A as a velocity — can be 
viewed as a kinematical effect! 

The tensor mode, corresponding to gravitational radiation, gives an extra 
term in the Hamiltonian — really in the relation between the proper and 
conjugate momenta — that is not present in the case of electromagnetism. 
Geometrically, h corresponds simply to a local volume-preserving defor- 
mation of the spatial coordinate lines, and in this way it simply extends 
the effect of the spatial curvature term cf>P in eq. ( 4.67 ) {<j> represents an 
orientation-preserving dilatation of the coordinate lines). However, what 
is more important is the dynamical effect of these terms, neither of which 
is familiar in either Newtonian gravity or electromagnetism. 

To study the dynamics of particle motion we use Hamilton's eqs. 



with the Hamiltonian of eq. (4.67). In terms of the proper momentum 



p measured by a comoving observer, Hamilton's equations in the Poisson 
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gauge become 



1/2 



dx d r 

_ = (l + ^ + 0_h.)^, £'e[| p + ^ 

[a (1 - 0+ h-)p] = e [g + v x if - v 2 Vcf) + v i v j Vh ij )] - ew , 

(4.70) 

where we have defined E' to be the proper energy in the normal frame, v 
is the peculiar velocity (in the weak-field limit it doesn't matter whether it 
is the coordinate or proper peculiar velocity nor whether it is measured in 
the comoving or normal frame) and g and H are the gravitoelectric and 



gravitomagnetic fields given by eqs. (4.61). The dot following h indicates 



the three-dimensional dot product, with h • p being a 3- vector. 

Equations (4.70) appear rather complicated at first but each term can 
be understood without much difficulty. First, note that the factor (1 + 
tp + <p — \n-) in the first equation is present solely to convert from a proper 
velocity to a coordinate velocity dx/dr according to the metric eq. (4.11). 
Using the transformation from the normal (primed) to comoving frame, 
p = p' — Ew ps p' — E'w, the equation for dx/dr implies that the proper 
velocity in the comoving frame must equal p/E' = p'/E' — w. This is 
identically true because p'/E' is the proper velocity in the normal frame, 
whose velocity relative to the comoving frame is —w. 

Similarly the factor a(l — + h-) in the momentum equation simply con- 
verts the proper momentum p to the comoving momentum in the comoving 



frame, P — ew (cf. eq. 4.66). The first two terms on the right-hand side 
have exactly the same form as the Lorentz force law of electrodynamics, 
with the electric charge e replaced by the comoving energy e and the electric 
and magnetic fields E and B replaced by their gravitational counterparts 
g and H. Thus, general relativity in the weak-field limit gives "forces" on 
freely-falling bodies (when expressed in the Poisson gauge) that are very 
similar to those of electromagnetism! 

The remaining terms in the momentum equation have no counterpart in 
electrodynamics or Newtonian gravity. There are two gravitational force 
terms quadratic in the velocity arising from spatial curvature. The first one 
is present for a scalar mode and is responsible for the fact that photons are 
deflected twice as much as nonrelativistic particles in a gravitostatic field 
(<f> = ip in the Newtonian limit). The second term represents, in effect, 
scattering of moving particles by gravitational radiation. A gravity wave 
traveling in the z-direction will accelerate a particle in this direction if the 
particle has nonzero velocity in the x-y plane (the direction of polarization 
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of the transverse gravity wave). If the particle is at rest in our coordinate 
system, it remains at rest when a gravity wave passes by. However, because 
the gravity wave corresponds to a deformation of the spatial coordinate 
lines, the proper distance between two particles at rest in the coordinate 
system does change (Misner ct al. 1973). 

Finally, the last term in the momentum equation, —e w, represents a sort 
of cosmic drag that causes velocities of massive particles to tend toward zero 
in the normal (inertial) frame (by driving p toward —Ew). The timescale 
for this term (the time over which e changes appreciably) is the Hubble 
time, so it should not be regarded as the frame dragging normally spoken 
of when loosely describing the vector mode. In fact, in the normal frame 
this term is absent, but then the gravitomagnetic term changes from evxH 
to eV(w? ■ v). The relative velocity of the comoving and normal (inertial) 
frames w is responsible for the frame-dragging and other effects; let us 
consider a particularly interesting one. 

In general, w varies with position so that at different places the iner- 
tial frames rotate relative to the comoving frame with angular velocity 
— JjV x w = —\H-, this is easily shown from a first-order Taylor series 
expansion of w with the constraint V • w = 0. As a result, a spin S will 
precess relative to the comoving frame at a rate dS/dr — — \H x S (the 
Lense-Thirring effect). Using the magnetic analogy, one would predict a 
gravitomagnetic precession rate jS x H in the comoving frame, where 7 is 
the gyrogravitomagnetic ratio. (The analogous magnetic precession rate is 
(ixB, where fi — jS.) Note that this result leads to the conclusion that 
there is a universal gyrogravitomagnetic ratio 7=5! 

Thus, one may interpret the vector mode perturbation variable w either 
as a source for (rather mysterious) frame-dragging effects, or as a vector 
potential for the gravitomagnetic field H. In the former case one can elim- 
inate w altogether by choosing orthogonal space and time coordinates such 
as given by the synchronous gauge. However, I prefer the latter interpre- 
tation because of the close analogy it brings to electrodynamics, allowing 
us to transfer our flat spacetime intuition to general relativity. The price 
to pay is that one must be careful to distinguish the comoving and normal 
frames. 

We have discussed the gravitomagnetic and gravitational wave contribu- 
tions to the equations of motion in order to illustrate the similarities and 
differences between gravity and electrodynamics. (They are clearest in the 
Poisson gauge; the interested reader may wish to rederive the results of this 
section in synchronous or some other gauge.) Why aren't we familiar with 
these forces in the Newtonian limit? The answer is because the sources 
of H and h are smaller than the source of the "gravitostatic" field — VV> 
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by 0(v/c) and 0(v/c) 2 , respectively (cf. eqs. 4.62 and 4.55). From eqs. 
(4.70), the forces they induce are smaller by additional factors of 0(v/c) 
and 0(v/c) 2 . Thus, for nonrclativistic sources and particles, the dynami- 
cal effects of gravitomagnetism and gravitational radiation are negligible. 
While ordinary magnetic effects are suppressed by the same powers of v /c, 
the existence of opposite electric charges leads in most cases to a nearly 
complete cancellation of the electric charge density but not the current 
density. No such cancellation occurs with gravity because energy density 
is always positive. 

Since typical gravitational fields in the universe have ip w </> ~ 10 -5 
and hij is much smaller than this, the curvature factors (1 + ip + <fi — h) 
and (1 — <j> + h) may be replaced by unity to high precision in eqs. (4.70) 
(and they are absent anyway in locally flat comoving coordinates). In the 
weak-field and slow-motion limit, then, eqs. (4.70) reduce to the standard 
Newtonian equations of motion in comoving coordinates. 

4-9. Lagrangian field equations 

General relativity makes no fundamental distinction between time and 
space, although we do. To obtain field equations that are similar to those 
of Newtonian gravity and electrodynamics, we have until now employed a 
"3+1 split" of the Einstein and energy conservation equations. Ellis (1971, 
1973), following earlier work of Ehlers (1961, 1971), Kundt & Trumpcr 
(1961), and Hawking (1966), has developed an alternative approach based 
on a "1+3 split" of the Bianchi and Ricci identities. The cosmological 
applications have been developed extensively by Ellis and others in recent 
years (Ellis & Bruni 1989; Hwang & Vishniac 1990; Lyth & Stewart 1990; 
Bruni, Dunsby & Ellis 1992; and references therein). Ellis' approach has 
some important advantages, as we shall see. 

The 3+1 split corresponds to the "slicing" of spacetime into a series of 
spatial hypersurfaces, each labeled by a coordinate time r. (The different 
splitting procedures are most easily visualized with one spatial dimension 
suppressed using a 2+1 spacetime diagram, with time corresponding to the 
vertical axis. The spatial hypersurfaces are then horizontal slices through 
spacetime.) Spacetime is described by Eulerian observers sitting in these 
hypersurfaces with constant spatial coordinates. 

The 1+3 split, called "threading," is complementary to slicing (Jantzen 
et al. 1992). In this case the fundamental geometrical objects used for 
charting spacetime are a series of timelike worldlines x M (A;q), where A is 
an affine parameter measuring proper time along the worldline and q gives 
a unique label (e.g., a spatial Lagrangian position vector) to each different 
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worldline (or "thread"). In this case spacetime is described by Lagrangian 
observers moving along these worldlines. 

The threading description is more general than the slicing one. If we 
take the threads to correspond to the worldlines of comoving observers in 
the slicing framework (lines of fixed x), then the two descriptions are the 
same. In the 1+3 description, however, different threads may cross with no 
harmful consequences while in the 3+1 description a spatial hypersurfacc 
must not be allowed to cross itself or other slices. Thus, the threading 
description may be used to follow the evolution of cold dust beyond the time 
when matter trajectories intersect, when the perfect-fluid Eulcr equations 
break down. The advantage of a Lagrangian description is well known 
for collisionlcss matter — the Lagrangian approach exclusively is used for 
nonlinear gravitational simulations — and the same advantages accrue even 
when describing the spacetime geometry itself. 

In the 1+3 approach each worldline threading spacetime has a time- 
like unit tangent vector (4-velocity) = dx^/dX = u^(X;q) such that 
u^u^ = — 1. Spacetime tensors are then decomposed into parts parallel 
and normal to the worldline passing through a given point. This decompo- 
sition is accomplished in a covariant form using the tangent vector and 
the orthogonal projection tensor 

P^(u) = g^ v + u^u v , (4.71) 

such that P l _ ll/ u v — and P^ K P KV = P^ v . P^ v is effectively the spatial 
metric for observers moving with 4-velocity (Ellis 1973). We may use it 
and w M to split any 4-vector A 11 into timelike and spacelike parts, labeled 
by the tangent vector of the appropriate thread: 

A(u) = -u^A" , A»(u) = P» V A V . (4.72) 

Even though A^(u) looks like (and is, in fact) a 4-vector, we can regard it 
as a 3- vector in the rest frame of an observer moving along the worldline 
x fJ, (X;q) because u li A ,t (u) = 0. [Note that A^ denotes the original 4- 
vector while A^{u) denotes its projection normal to u^. We shall include 
the argument (u) for the projection whenever needed to remove ambiguity] 
We require that at each point in spacetime there is at least one thread with 
corresponding tangent u M (A; q). If there are several threads then there are 
several different decompositions of A(u) and A^{u) at each labeled by 
q (implicitly, if not explicitly) through u^(X;q). This causes no problems 
as long as we refer to a single distinct thread, which we do by retaining u 
in the argument list. 
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The decomposition of a second-rank tensor T^ v is similar: 

T{u) = u^u v T^ , T M (u) = g^T v (u) = -P^u a T a » , 

T"„(v) = P» a P v pT a P . (4.73) 
As an exercise one may apply this decomposition to the stress-energy tensor 



of eq. (4.19) using the comoving observers to define the threading. For 
v 2 <C 1, one obtains nonzero elements T — p, TJ = a(p + p)vi (with no Wi), 
and T l = pS 1 , + S 1 -. Be careful to distinguish the 4- velocity of the threads 



(with v l = 0) from those of the matter (eq. 4.22). 

Now that we have described the 1+3 spacetime splitting procedure, we 
are ready to apply it to gravity following Hawking (1966) and Ellis (1971, 
1973). What equations should we use? One might think to split the Ein- 
stein equations using 1+3 threading, but this does not add anything funda- 
mentally new to what we have already done. The correct approach suggests 
itself when we think in Lagrangian terms following a freely-falling observer, 
whose worldline defines one of the threads. Such an observer feels no grav- 
itational force at all but does notice that adjacent freely-falling observers 
do not necessarily move in straight lines with constant speed. In Newto- 
nian terms this is explained by "tidal forces" while in general relativity it 
is called geodesic deviation. We shall not present a derivation of geodesic 
deviation here (one may find it in any general relativity textbook) but sim- 
ply note that it follows from the non-commutativity of covariant spacetime 
derivatives of the 4-velocity. The relevant equation is the 4-dimensional 
version of the first of eqs. (4.5), called the Ricci identity: 

[V K ,Vx]u» = R» VKX u v . (4.74) 

This identity holds for any differentiable vector field . In the Lagrangian 
field approach we seek evolution equations for the Riemann tensor itself 
rather than the metric tensor components. 

One advantage of working with the Riemann tensor is the fact that part 
of it — the Ricci tensor — is given algebraically by the local stress-energy 



through eqs. ( [4.7j ) and (4.S). However, one cannot (in 4 dimensions) recon- 
struct the entire Riemann tensor from the Ricci tensor alone. One could 
obtain it by differentiating the metric found by solving the Einstein equa- 



tions (cf. eqs. [4.9| , 4.10). As we shall see, there is another method that 
does not require integrating the Einstein equations. 

This alternative method is based on an evolution equation for that part 
of the Riemann tensor that cannot be obtained from the Ricci tensor, the 
Weyl tensor C^ K y. 
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Cfj,i//^x — R(j,ukX 2 ^Si^k^vX ~h 9vxR[ik, 9ii\Rvk, QvK,R[lX) 
R 

+-q (9hk9v\ - 9iiX9vk) ■ (4-75) 

This tensor obeys all the symmetries of the Riemann tensor — C^ VKl \ = 
C\p.v][nX\ = C K \^ V and C^^a] = (where square brackets denote antisym- 
metrization) — and in addition is traceless: C K „ KV — 0. Thus, the trace 
part of the Riemann tensor is given by the Ricci tensor i? M „ (through the 
Ricci terms on the right-hand side of eq. 4.75) while the traceless part is 
given by the Weyl tensor. Physically, the Ricci tensor gives the contribu- 
tion to the spacetime curvature from local sources (through the Einstein 
eqs. 4.7 combined with 4.8) while the Weyl tensor gives the contribution 
due to nonlocal sources. It is clear that Newtonian tidal forces will be 
represented in the Weyl tensor. It may be shown that in 4 dimensions the 
Ricci and Weyl tensors each have 10 independent components. 

How do we get an evolution equation for the Weyl tensor? The Einstein 
equations will not do because the Weyl tensor makes no appearance at 
all in the Einstein tensor. The correct method, due to Kundt & Trumper 
(1961), makes use of the Bianchi identities, 

V a R^ K \ + V^RuanX + V„R^ K x = . (4.76) 

These identities follow directly from the definition of the Riemann ten- 
sor (see any general relativity or differential geometry textbook). For our 
purposes the key point is that the y pro vide differential equations for the 
Riemann tensor. Contracting eq. ( 4.76|) on k and a and using eqs. (4.75) 
and (4.8), we get 

V K C^ KX = V^G^x + i gx {i y v] G\ . (4.77) 

Note that if we contract now on A and /x, using the symmetry of and 
g^v w e get V^G^ 1 ^ = 0, as noted before. However, here we regard eq. 
(4.77) as an equation of motion for the Weyl tensor. Using the Einstein 
eqs. (4.7), we see that the source is given in terms of the energy-momentum 
tensor, so 

V K C^ KX = SttG (v^T^ + ^g^V^T^ . (4.78) 

The next step is to split the Weyl tensor into two second-rank tensors 
using a 1+3 threading of spacetime (Hawking 1966, Ellis 1971), 

(u) = u K u x C^ vX , (u) = \ e a p K{ll u K u x C aP y)x . (4.79) 
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We have used the fully antisymmetric tensor e M „ K >, = (— g) 1//2 [/ivkX], where 
g is the determinant of g M „ and [[ivkX] is the completely antisymmetric 
Levi-Civita symbol defined by three conditions: (1) [0123] = +1, (2) [/iukX] 
changes sign if any two indices are exchanged, and (3) [/wkA] = if any 
two indices are equal. (Note that Ellis uses the tensor TjuvKX — —e^ VKl \. We 
have compensated for the sign change in defining H^ u . Beware that (^ vkX = 
— (— g) -1 / 2 [iiukX].) The two new tensors £^„ and H^ v are both symmetric 
(ii^u must be explicitly symmetrized), traceless, and flow-orthogonal, i.e., 
E^u" = H^u" = and P v K E ilv = E^, P\H^ = H^. Therefore 
and Hftv each has 5 independent components, half as many as the Weyl 
tensor. Indeed, the Weyl tensor is fully determined by them for non-null 
threads: 

C^kX — (g^vaP 9k\jS — ^tivaP ^kXjs) U a U' ' E l3i \u) 

+ [tilvaP 9k\ 1 S + g^ali e-nX 1 s) !l°II 7 lf' 3i (ll) , (4.80) 

where g^ a p = g^g^p - g^va = ~h e l ,„ K,Xe K.x a = g[nv][ap\ = 9a/3^, with 



9^{vai3] — 0. Eq. (4.80) is the inverse of eqs. ( |4.79| ) provided g p _ v u^u v = ±1. 
Ellis (1971) has a sign error in the first term of his version of eq. (4.80) at 
the end of his section 4.2.3. 

The tensors E^(u) and H^„(u) are called the electric and magnetic parts 
of the Weyl tensor, respectively. Together with the Ricci tensor they fully 
determine the spacetime curvature for a given threading (i.e., a system of 
threads with tangent vectors) u^(X;q). It is worth noting that, if there 
are several threads at a given spacetime point, -E^^u) and H^ u (u) have 
different values for each thread, and so they may be considered Lagrangian 
functions: E^ ly (X;q) and H tlv (X;q). The Weyl tensor components are, 
however, unique, with the same value for all threads passing through the 
same spacetime point. This condition is satisfied automatically if the same 



4-velocity is used in both eqs. (4.79) and (4.80). 

Our goal is to rewrite eq. (4.78) in terms of E^ v and if M „. Because the 



results involve the covariant derivative of the 4-velocity field V^it,,, we first 
decompose this quantity into acceleration, expansion, shear, and vorticity: 

V M u„ = + P^P^V a u/3 = -u^ + i 0P M „ + <j [_iu + uJn U ; 

9 = V^i/ 1 , = a-( pw ) , w M „ = u>[^] = e^apu ^ 13 . (4.81) 

We have introduced the covariant derivative in the direction it", D/dX = 
h"V„. Since this is just the proper time derivative along the worldlinc, 
a„ = Du v /dX is the 4-acceleration. The flow-orthogonal part of the velocity 
gradient, P^P^V ' a v,p, has been decomposed into the expansion scalar 
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0, the traceless shear tensor cr^, and the vorticity tensor u>^ v or its flow- 
orthogonal dual, uj^. Note that the expansion scalar includes a contribution 
due to cosmic expansion in addition to the peculiar velocity: neglecting 
metric perturbations, 9 = a -1 ^ + V • v). Note also that in the fluid rest 
frame, w'e^ = -|V x v is half the usual three-dimensional vorticity. (Ellis 
defines uj^ and w M with the opposite sign to us.) 

We shall apply this gradient expansion to the tangent field of the 1+3 
spacetime threading. This requires that be differentiable, which will be 
true (almost everywhere) if it corresponds to the 4-velocity field of a flow. 
In a frame comoving with the fluid, O, aij and u>ij arc then the usual fluid 
expansion, shear, and vorticity, respectively. 

By projecting V K C M „ K >, with various combinations of u a and P a/3 (u) 
(these are dependent on the spacetime threading), one can derive the fol- 
lowing identities: 

u"u x V k C^ kX = P» a P» V v E afJ + u v a ai H\ - 3H^ , (4.82) 

-3E» u u u , (4.83) 

r) zpa{3 

P^P^V K C aPKX = P^ P ^__ + p^ e ^ Up V y H aS 

+2u a apH^e v)a ^ + QE^ + P^ {a af} E a!i ) 
-2E av {o\ - <,) - E"»(a\ + W "J , (4.84) 

^ a P uX u e a ^ s V K C l5K x = -P*J*™^- + P^e^upV^s 

+2u a a p E 1 ^e u)a ^ - BH^ - P» u {a afi H af} ) 

+2H^(a u a - u> v a ) + H av {a\ + w"J . (4.85) 

These identities follow from eqs. (4.80) and (4.81). All quantities on the 
right-hand sides are to be evaluated for a given thread it M (A; q). 

Finally we are ready to obtain equations of motion for the electric and 



magnetic parts of the Weyl tensor from eq. ( 4.78 ). In fact, infinitely 
many sets of equations are possible because are free to use any spacetime 
threading! For example, we may choose Eulerian threading with q = x, 
in which case in the Poisson gauge we have u° = a (1 — ip) and u l — 0, 
so that D/d\ = a _1 (l — ip)d r is the Eulerian proper time derivative. In 
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this case the 1+3 split coincides with our previous 3+1 split. The Eulerian 
description is not covariant, for it depends on our choice of gauge. Because 
the Weyl tensor formalism is more complicated than our previous treatment 
based on the Einstein equations, there is no clear advantage to its use with 
Eulerian threading. 

If, however, we use the fluid velocity itself — the appearing in eq. 
(4.19), which is well-defined even for an imperfect or collisionless fluid — 
to define the threading, then the Weyl tensor approach becomes more at- 
tractive. This choice corresponds to Lagrangian threading: the threads are 
the worldlines of fluid elements, so that D/dX now is the proper time deriva- 
tive measured in the fluid rest frame. There are two important advantages 
to this choice. First, it is covariant: the fluid worldlines define a unique 
spacetime threading with no gauge ambiguities (Ellis & Bruni 1989), while 
any coordinates may be used to express the tensor components and 
H^ v . Second, the right-hand side of eq. ( 4.78 ) — the source for the Weyl 
tensor — is expressed in terms of the same 4- velocity used in the threading, 
greatly simplifying the projections appearing in eqs. ( 4.82 )-(4.85). 

Ellis (1971) and Hwang & Vishniac (1990) give the Lagrangian gravita- 
tional field equations for a general stress-energy tensor. For a perfect fluid 
(with = in eq. |4.19|) the results are 

(div-£) : P» a P v p V v E a P + e^Uva^lTp - 3H» v u v 

= y GP» v V vP , (4.86) 

(H) : pn a p» p —^- - P a ^e^ 5 upV^E a5 

-2u a a p E 7 ^e^ al3 ~< + QH»" + P^{a al3 H a0 ) - 3H a ^a^ a 
+H a ^u"\ = , (4.87) 
(div-ff) : P^ a P u p V u H aP - ^Uua^BTp + 3E» v u>" 

= -8wG{p + p)^ , (4.88) 

DE a P 

+2u a apH^e w)a ^ + + P^{a a0 E al3 ) - 3E a ^a v) a 

+E a ^oj^ a = -4vrG( / 9 + p)a^ . (4.89) 

These hav e bee n obtained by substituting eqs. ( 4.19| ) and ( 4.82 )-(4.85) 
into eq. ( 4.78 ), and using V ' v T^ iV = to simplify the right-hand sides 
of the div--E and E equations. The results agree with eqs. (4.21) of Ellis 



(E) : P» a P» —— + P a ^e^ s upV^H aS 
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(1971). For an imperfect fluid it is necessary to add terms to the right-hand 
sides involving the shear stress T,^ . For a pressureless fluid (e.g., cold dust 
before the intersection of trajectories) the 4-acceleration ap vanishes. 

In his beautifully lucid pedagogical articles presenting the Lagrangian 
fluid approach, Ellis (1971, 1973) has noted the similarity of eqs. (4.86)— 
(4.89) to the Maxwell equations, particularly if the covariant form of the 
latter are split using 1+3 threading. Compare them with eqs. (4.62) for 
the vector (not tensor) gravitational fields in the Poisson gauge. Although 
the latter equations are more reminiscent of the Maxwell equations in flat 
spacetime, they are only approximate (they are based on a linearized met- 
ric and neglect several generally small terms), they are tied to a particular 
coordinate system (Poisson gauge), and they do not incorporate gravita- 
tional radiation. By contrast, eqs. (4.86)-(4.89) are exact, they are valid 
in any coordinate system (all quantities appearing in them are spacetime 
tensors), and they include all gravitational effects. The exact equations in- 
volve second-rank tensors rather than vectors because, in the terminology 
of particle physics, gravity is a spin-2 rather than a spin-1 gauge theory. 

The quasi-Maxwellian equations (4.86)-(4.89) show that the evolution 
of the Weyl tensor depends on the fluid velocity gradient. This quantity 
could be computed by evolving the equations of motion for the matter 



(e.g., eqs. 1.24 and 4.25) to get the velocity field u^{x) and then taking 
its derivatives. However, there is a more natural way in the context of 
the Lagrangian approach: integrate evolution equations for the velocity 
gradient itself. In fact, such equations follow simply from projecting the 
Ricci identity (4.74) for the fluid velocity it M with u K P aX Pp^ and separating 



the result as in eqs. (4.81). It is straightforward to derive the following 
equations (Ellis 1971, 1973): 



^T- V ^ + 3 



- V M a" + - 6 2 + a^a^u - 2lu 2 = -4irG(p + 3p) , (4.90) 



+ \ ^ a0 u v V a a [3 + | 9^ - <r» v u v = , (4.91) 



dX 3 



1 p„v ( a ^ aafj +UJ 2_ = _ Eii v ^ ( 4 92 ) 

o 



where io 2 = lo^oj^ . Equation ( 1.9C ) is known as the Raychaudhuri equation. 



It shows that the expansion is decelerated by the shear and by the local 
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density and pressure (if p + 3p > 0) , but is accelerated by the vorticity. 



Vorticity, on the other hand, is unaffected by gravity; eq. (4.91) implies that 
vorticity can be described by field lines that (if a M vanishes or if the fluid has 
vanishing shear stress) are frozen into the fluid (Ellis 1973). Finally, shear, 
being the traceless symmetric part of the velocity gradient tensor, has as its 
source the electric part of the Weyl tensor. These equations are essentially 
identical to their Newtonian counterparts (Ellis 1971; Bertschinger & Jain 
1994). Note that the magnetic part of the Weyl tensor does not directly 
influence the matter evolution. 

Closing the Lagrangian field equations also requires specifying the evolu- 
tion of density and pressure (and shear stress, if present). These follow from 
energy conservation, V„T^= 0, combined with an equation of state. For 
a perfect fluid, using eq. (|4.19|) with = and projecting the divergence 
of the stress-energy tensor with gives 

^ + (p + P )e = . (4.93) 

Equations (4.86)- j4.93f) now provide a set of Lagrangian equations of mo- 
tion for the matter and spacetime curvature variables following a mass 
element. These Lagrangian equations of motion offer a powerful approach 
to general relativity — and to relativistic cosmology and perturbation the- 
ory — that is quite different from the usual methods based on integration 
of the Einstein equations in a particular gauge (or with gauge-invariant 
variables) . 

To relate the relativistic Lagrangian approach to dynamics to the stan- 
dard Newtonian one, we now evaluate the electric and magnetic parts of 
the Weyl tensor in the weak- field, slow- motion limit. They involve second 
derivatives of the metric and not simply the first derivatives present in eqs. 



(4.61). In the Poisson gauge, to lowest order in the metric perturbations 



and the velocity, from eqs. (4.79) one obtains (Bertschinger & Hamilton 
1994) 

1 1 1 •• 2 

Hij = -\ V {l H 0) + e kl[l V k h 3 j , (4.94) 



where Hj is the gravitomagnetic field defined in eq. ( 4.61 ). The time- 
time and space-time components of E^ u and vanish in the fluid frame 
because these tensors are flow-orthogonal. 

Do these results imply that in the Newtonian limit fly = and By = 
Dij(j> is simply the gravitational tidal field? If we say that the Newtonian 
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limit implies ip = <f> and Wi = hij = (no relativistic shear stress, no 
gravitomagnetism, and no gravitational radiation), then the answer would 
appear to be yes. This possibility, considered by Matarrese, Pantano, & 
Saez (1993) and Bertschinger & Jain (1994), has an important implication: 
for cold dust, the Lagrangian evolution of the tidal tensor obtained from 
eq. (4.89) would then be purely local (Barnes & Rowlingson 1989). That 
is, the evolution of the tide (the electric part of the Weyl tensor) along 
the thread u M (A;q) would depend only on the density, velocity gradient, 
and tide defined at each point along the trajectory with no further spatial 
gradients (since they arise only from the magnetic terms in eq. 4.89). The 
evolution of the density and of the velocity gradient tensor are clearly local 



(eqs. 4.9C - 1.95 , with a M = 0) aside from the tidal tensor, but we have just 
seen that its evolution depends only on other local quantities. In other 
words, if H.^ = 0, the matter and spacetime curvature variables would 
evolve independently along different fluid worldlines. Bruni, Matarrese, 
and Pantano (1994) call this a "silent universe." 

Local evolution does occur if the metric perturbations are one-dimensional 
(e.g., the Bondi-Tolman solution in spherical symmetry, or the Zel'dovich 
solution in plane symmetry; see Matarrese et al. 1993 and Croudace et al. 
1994), but it would be surprising were this to happen for arbitrary matter 
distributions in the Newtonian limit. 

Bertschinger & Hamilton (1994) and Kofman & Pogosyan (1995) have 
shown that, in fact, the general evolution of the tidal tensor in the New- 
tonian limit is nonlocal. The reason is that, while one may neglect the 
metric perturbation wi in the Newtonian limit, its gradient should not be 
neglected. Doing so violates the transverse momentum constraint equation 
(4.51), unless the transverse momentum density (the source term for w 
in the Poisson gauge) vanishes. This condition does not hold for general 
motion in the Newtonian limit. 

A convincing proof of nonlocality is given by the derivation of eq. (4.89) 
in locally flat coordinates in the fluid frame by Bertschinger & Hamilton 
(1994) using only the Newtonian continuity and Poisson equations plus the 
second pair of eqs. (4.62) and a modified form of eq. (4.94): 



1 



H ij = - g V ii H i) - 2v * e (i E i)i + 0{v/cf . (4.95) 

This is taken as the definition of in the Newtonian limit (where we 
also have Eij = Dij(f>). Note that in the Newtonian limit we neglect grav- 
itational radiation, but we must include terms that are first-order in the 
velocity. Even though we define the magnetic part of the Weyl tensor us- 
ing the fluid 4-velocity, we are evaluating its components in a particular 
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gauge — Poisson gauge — in which the 3-velocity does not necessarily 
vanish. The extra term in eq. (4.95) arises from evaluating eqs. (4.79) to 
first order in v/c (Bertschinger & Hamilton 1994) and it is analogous to 
the Lorentz transformation of electric fields into magnetic fields in a mov- 
ing frame. Both terms in eq. (4.95) are of order Gpv. They can not be 
neglected in the Newtonian limit. 

The implication of this result is that Lagrangian evolution of matter 
and gravity is not purely local except under severe restrictions such as 
spherical or plane symmetry. There exist, of course, local approximations to 
evolution such as the Zel'dovich (1970) approximation. Finding improved 
local approximations is one of the active areas of research in large-scale 
structure theory. Formulating the problem in terms of the Lagrangian fluid 
and field equations not only may suggest new approaches, it is also likely 
to clarify the relation between general relativity and Newtonian dynamics. 
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